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OB PROTEIN RECEPTOR AND RELATED COMPOSITIONS AND METHODS 
FIELD OF THE INVENTION 

5 The present invention relates to OB protein 

receptors, related compositions and methods of making 
and using such receptors and related compositions. 

BACKGROUND 

10 Although the molecular basis for obesity is 

largely unknown, the identification of the "OB gene" and 
protein encoded ("OB protein") has shed some light on 
mechanisms the body uses to regulate body fat deposi- 
tion. Zhang et al., Nature 372 : 425-432 (1994); see 

15 also , the Correction at Nature 374 : 479 (1995) . The OB 
protein is active ill vivo in both ob/ob mutant mice 
(mice obese due to a defect in the production of the OB 
gene product) as well as in normal, wild type mice. The 
biological activity manifests itself in, among other 

20 things, weight loss. See generally , Barinaga, "Obese" 

Protein Slims Mice, Science 269 : 475-476 (1995) . See PCT 
International Publication Number WO 96/05309, 
"Modulators of Body Weight, Corresponding Nucleic Acids 
and Proteins, and Diagnostic and Therapeutic Uses 

25 Thereof," herein incorporated by reference. 

The other biological effects of OB protein are 
not well characterized. It is known, for instance, that 
in ob/ob mutant mice, administration of OB protein 
results in a decrease in serum insulin levels, and serum 

30 glucose levels. It is also known that administration of 
OB protein results in a decrease in body fat. This was 
observed in both ob/ob mutant mice, as well as non-obese 
normal mice, Pelleymounter et al.., Science 269 : 540-543 
(1995); Halaas et al., Science 269 : 543-546 (1995). Seg 
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alsa, Campfield et al . , Science 2£2: 546-549 (1995) 
(Peripheral and central administration of microgram 
doses of OB protein reduced food intake and body weight 
of ob/ ob and diet- induced obese mice but not in db/db 
5 obese mice.) In none of these reports have toxicities 
been observed, even at the highest doses. 

Despite the promise of clinical application 
of the OB protein, the mode of action of the OB protein 
in vivo is not clearly elucidated, in part due to the 

10 absence of information on the OB receptor. High affinity 
binding of the OB protein has been detected in the rat 
hypothalamus, reportedly indicating OB receptor loca- 
tion. Stephens et al., Nature 222: 530-532 (1995). The 
db/db mouse displays the identical phenotype as the 

15 ob/ob mouse, extreme obesity and Type II diabetes; 

this phenotype is thought to be due to a defective OB 
receptor, particularly since db/db mice fail to respond 
to OB protein administration. Sej= Stephens et al . , 

20 Identification of the OB protein receptor is 

key in determining the pathway of signal transduction. 
Moreover, identification of the OB protein receptor 
would provide powerful application in diagnostic uses, 
for example, to determine if individuals would benefit 

25 from OB protein therapy. Furthermore, the OB recepto r 
could be a key component in an assay for d etermining — 
' addition al mole ^ulgj^Mhi^i-4^ the receptor and — 

result in desired biological activity. Further, such 
soluble receptor could enhance or alter the effective- 

30 ness of OB protein (or analog or derivative thereof) . 

SUMMARY OF TH F. INVENTION 

The present invention relates to a novel class 
of protein receptors, herein denominated "OB protein 
35 receptors" or "OB receptors", which are thought to 
selectively bind OB protein. As such, the novel OB 
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receptor family is provided, as well as novel members of 
such family. Also provided are nucleic acids, vectors 
and host cells containing such nucleic acids, related 
antisense nucleic acids, molecules which selectively 
5 bind to the OB protein receptor, and related composi- 
tions of matter, such as OB receptor protein/OB protein 
complexes- In other aspects, the present invention 
relates to methods of using the above compositions, such 
as therapeutic and/or diagnostic methods, and methods 
10 for preparing OB receptor ligands. 

DETAILED DESCRIPTION 

A novel family of OB receptors is provided. 
This novel family resulted from identification of a PCR 

15 fragment isolated from a human liver cell cDNA library. 
The original PCR fragment, from which primers were 
isolated, contained a "WSXWS" motif, common to cytokine 
receptors. As illustrated by the working examples 
below, using this fragment four members of this OB 

20 protein receptor family have been identified. These 

members, herein designated as "A", M B", and "C", and W D" 
are indent ical at amino acid position 1-8 91 (using the 
numbering of Seq. ID No. 1), but diverge at position 892 
through the C-terminus. They vary in length at the 

25 C-terminus beyond amino acid 8 91, and the different 
forms appear to have different tissue distribution. 

Using hydrophobicity analysis, the leader 
sequence is likely to comprise amino acids (Seq. ID. 
No. 1) 1-21, 1-22, or 1-28. The first amino acid of the 

30 mature protein is likely to be 22 (F) , 23 (N) or 29 (T) . 
Most likely, based on analysis of eucaryotic cell 
expression (CHO cell expression see Example 8, infra ) . 
the first amino acid of the mature- protein is 22 (F) . 
The beginning of the transmembrane domain appears to be 

35 located at position 840 (A) or 842 (L) . The end of the 
transmembrane domain appears to be located at position 
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10 



15 



20 



25 



862 (i), 863 (S) or 864 (H) . Thus, based on predictions 
from hydrophobicity analysis, for OB protein binding, at 
a minimum what is needed is the extracellular domain of 
the mature protein, amino acids 22, 23 or 29 through 
amino acids 839 (D) or 841 (G) . Therefore, the present 
class of OB receptor proteins includes those having 
amino acids (according to Seq. ID No. 1) : 

(a) 1-896; 
22-896; 



30 



35 



(b) 
(O 
(d) 
(e) 
<f) 
ih) 
(i) 

C)> 
(k) 

(1) 
(m) 
(n) 
. (o) 
(P) 



23-896; 
29-896; 
1-839; 
22-839; 
1-841; 

22- 841; 

23- 841; 
29-841; 
1-891; 

22- 891; 

23- 891; 
29-891; 

the amino acids of subparts (1) 
through (o) having the C-terminal amino acids selected 
from among: 

OB receptor B (Seq. ID No. 3) 



(i) 



positions 892-904; 

(ii) 

positions 892- 958; and, 

(iii) 

positions 892-1165; 

(q) 

d, f, g, i, j, k, n, o 



OB receptor C (Seq. ID No. 5) 

OB receptor D (Seq. ID No . 7) 

amino acids of subparts b, c, 
and any of (p) lacking a 



leader sequence, which have an N-terminal methionyl 
residue. 

Also provided herein is what is thought to be 
a human splice variant of a soluble OB receptor. This 
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splice variant includes the extracellular domain at 
least up to amino acid 798 (of Seq. ID No, 1, for 
example) and has a unique 6 amino acid C-terminus at 
positions 799-804: G K F T I L. 
5 The functional domains of the OB receptor may 

be predicted using the information contained in Bazan et 
al., PNAS-USA fil: 6934-6938 (1990) (incorporated herein 
by reference) . For the present OB receptor, there are 
two hematopoietin domains, a random coil region, the 
10 transmembrane domain, and the intracellular domain. The 
overall geography may be illustrated as follows: 




RC/ 



Box 1 Box2 Box3 
HI H2 TM 

IC 



Using the information provided by Bazan, 
15 supra , the domains may be predicted, with essentially an 

error of approximately plus or minus three base pairs 
(as applied to all amino acid location specified for 
purposes of identifying the Bazan predicted domains) . 
The precise locations may be determined empirically by 

20 methods known in the art, such as preparing and 

expressing modified recombinant DNAs. The structural 
characteristics are though to be important for 
maintaining the structural integrity of the molecule, 
and therefore, to the extent that such structure is 

25 important for function, for functional characteristics 
as well. 

The hematopoietin domains (HI and H2) are 
thought to have two fibronectin type 3 repeats each, one 
set of paired cysteine residues each (thought to form a 
30 disulfide bridge) , and one "WSXWS box" (referring to the 
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single letter amino acid abbreviation, with "X" being 
any amino acid) . The f ibrinectin type 3 domains may be 
identified by location of a double proline ("PP") , which 
marks the beginning of the second fibronectin type 3 
5 repeat; the actual beginning of such second fibronectin 
type 3 repeat is likely to begin about 3 amino acids 
upstream of that double proline. 

The first hematopoietin domain is likely to 
begin at amino acid 123 (using the numbering according 
10 to Seq. ID No. 1, for example), which is an isoleucine 
residue (I). The last amino acid of the hematopoietin 
domain is likely to be amino acid 339, which is a lysine 
(K) residue. The two fibronectin type 3 repeats are 
likely to be located at (about) amino acids 123 through 
15 235 and 236 through 339. There is a single pair of 

cysteine residues which likely form a disulfide bridge, 
located at position 131 and position 142. The "WSXWS 
box" is located at position 319 through 323. 

The second hematopoietin domain is likely to 
20 begin at position 428, which is an isoleucine (I) and 

end at position 642 which is a glycine (G) . The paired 
fibronectin type 3 repeats are located at about position 
428 through position 535 and about position 536 through 
about position 642. One pair of cysteines is located at 
25 position 436 and position 447, and the second pair is 
located at position 473 and 488. The "WSXWS box" is 
located at position 622-626, 

Between the first and the second hematopoietin 
domain (amino acids 339-428, approximately) is a region 
30 of unknown functional significance. 

The random coil domain ("RC" between the H2 
and the transmembrane domain, "TM") is likely to begin 
at the amino acid following the end of the second 
hematopoietin domain, and is likely to end at the 
35 beginning of the transmembrane domain. This is likely 
to be from about amino acid 642 through amino acid 839 
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or 841 (with the transmembrane domain beginning at 
position 840 (A) or 842 (L) ) . The intracellular domain 
PIC") is likely to begin at position 861 (L) , 862 (I) , 
863 (S) or 864 (H) . 
5 The intracellular domain ("IC" ) contains three 

regions, or "boxes, M thought to participate in signal 
transduction (two "JAK" boxes and a single "STAT" box, 
"Box 1", "Box 2", and "Box 3") . With respect to the 
numbering of the amino acid positions of the "D" form of 

10 the OB receptor (Seq. ID No. 7, below), box 1 is located 
at amino acid 871 (F) through 878 (P) . Box 2 is located 
at approximately amino acid number 521 (I) through 
931 (K) . Box 3 on the M D" form is located at approxi- 
mately position 1141 through 1144 (amino acids YMPQ, as 

15 the "STAT" box is typically a conserved region of "YXXQ" 
wherein "X" designates any amino acid) . The intracel- 
lular domain is thought to be responsible for signal 
transduction. One possible mode of action is via 
phosphorylation of various residues. See Ihle et al., 

20 Cell M: 331-334 (1996) (Review article, herein 
incorporated by reference . ) 

One possible mode of action is that upon 
ligand binding (here, OB protein binding) , the OB 
receptor dimerizes with another receptor. A kinase 

25 ("JAK") binds to box 1, and becomes phosphorylated. 

(The JAK may already be bound prior to dimerization. ) 
Also, "STATS" bind to box 3 and become phosphorylated on 
a specific tyrosine. It is thought that this 
phosphorylation results, probably indirectly, in DNA 

30 binding protein production, which results in altered DNA 
transcription, and therefore altered expression. As seen 
below in Example 6, one measurement of the capability of 
an OB receptor to transduce signal is the degree of 
phosphorylation of JAK/STAT molecules. 

35 The C-terminus region is intracellular (of 

cell-bound OB receptor) . The differences in the C- 
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terminus among members of the present OB receptor family 
may result in differences in signal transduction among 
the species- Thus, the present OB receptors include at 
least the extracellular do main whic h is important for OB 
'5 protein ligand binding. Nucleic acids encoding the 

present OB receptors, vectors, and host cells are also 
provided for herein- 

The extracellular domain may be modified and 
still retain the function of ligand binding, 
10 particularly by one or more of the following 

modifications: (a) the random coil domain (as indicated 
above, occuring downstream of the second hematopoietic 
domain through the beginning of the transmembrane 
domain) may be deleted (this may be approximately 
15 positions 642 through 839 or 841); (b) the "WSXWS" box 
may be modified by (i) .substitution of the first serine 
with another amino acid, particularly conserved in terms 
of hydrophobicity and/or charge, such as a glycine; (ii) 
the last serine may be substituted with another amino 
20 acid, such as a threonine; (iii) the first tryptophan 

may be substituted with another amino acid, for example, 
a tyrosine. 

Human genomic DNA encoding OB receptor protein 
is also provided herein. The genomic DNA has been 
25 localized to human chromosome IP 31*, which is believed to 
correspond to mouse chromosome 4, the location of the 

mouse db locus. 

Tissue distribution analysis demonstrates the 
presence of OB receptor nucleic acids is fairly 

30 ubiquitous, and particularly noted in the liver. It is 
also observed in the ovary, and heart; and, to a lesser 
extent, in small intestine, lung, skeletal muscle, 
kidney, and, to an even lesser extent, spleen, thymus, 
prostate, testes, placenta and pancreas (Example 2, 

35 below) . There may also be one or more forms of the OB 
receptor present in serum, such as soluble OB receptor, 



WO 97/25424 



PCT/US97/00128 



- 9 - 

which may be complexed to one or more forms of the OB 
protein, 

Amino Acid Sequences and Compositions 
5 According to the present invention, novel OB 

protein receptors and DNA sequences coding for all or 
part of such OB receptors are provided. The present 
invention provides purified and isolated polypeptide 
products having part or all of the primary structural 

10 conformation (i.e., continuous sequence of amino acid 
residues) and one or more of the biological properties 
(e.g., immunological properties and in vitro biological 
activity) and physical properties (e,g., molecular 
weight) of naturally-occurring mammalian OB receptor 

15 including allelic variants thereof. The term "purified 
and isolated" herein means substantially free of 
unwanted substances so that the present polypeptides are 
useful for an intended purpose. For example, one may 
have a recombinant human OB receptor substantially free 

20 of human proteins or pathological agents. These 

polypeptides are also characterized by being a product 
of mammalian cells, or the product of chemical synthetic 
procedures or of procaryotic or eucaryotic host 
expression (e.g., by bacterial, yeast, higher plant, 

25 insect and mammalian cells in culture) of exogenous DNA 
sequences obtained by genomic or cDNA cloning or by gene 
synthesis. The products of expression in typical yeast 
(e.g., Saccharomvces cerevisiae) , insect, or procaryote 
(e.g., £. coli ) host cells are free of association with 

30 any mammalian proteins. The products of expression in 
vertebrate (e.g., non-human mammalian (e.g. COS or CHO) 
and avian) cells are free of association with any human 
proteins. Depending upon the host employed, and other 
factors, polypeptides of the invention may be 

35 glycosylated with mammalian or other eucaryotic 

carbohydrates or may be non-glycosylated. One may modify 
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10 



the nucleic acid so that glycosylation sites are 
included in the resultant polypeptide. One may choose to 
partially or fully deglycosylate a glycosylated 
polypeptide. Polypeptides of the invention may also 
include an initial methionine amino acid residue (at 
position -1 with respect to the first amino acid residue 
of the mature polypeptide) . 

In addition to naturally-occurring allelic 
forms of OB receptor, the present invention also 
embraces other OB receptor products such as polypeptide 
analogs of OB receptor and fragments of OB receptor. 
Following the procedures of the above noted published 
application by Alton et al. (WO 83/04053), one can 
readily design and manufacture genes coding for 
15 microbial expression of polypeptides having primary 

conformations which differ from that herein specified 
for in terms of the identity or location of one or more 
residues (e.g., substitutions, terminal and intermediate 
additions and deletions) . Alternately, modifications of 
20 cDNA and genomic genes may be readily accomplished by 
well-known site-directed mutagenesis techniques and 
employed to generate analogs and derivatives of 03 
receptor. Such products would share at least one of the 
biological properties of mammalian OB receptor but may 
25 differ in others. As examples, projected products of 
the invention include those which are foreshortened by 
e.g., deletions; or those which are more stable to 
hydrolysis (and, therefore, may have more pronounced or 
longer lasting effects than naturally-occurring); or 
30 which have been altered to delete one or more potential 
sites for glycosylation (which may result in higher 
activities for yeast -produced products) ; or which have 
one or more cysteine residues deleted or replaced by, 
e.g., alanine or serine residues and are potentially 
35 more easily isolated in active form from microbial 

systems; or which have one or more tyrosine residues 
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replaced by phenylalanine; or have an altered lysine 
composition (such as those prepared for purposes of 
derivatization) . Included are those polypeptides with 
amino acid substitutions which are ^conservative" 
5 according to acidity, charge, hydrophobicity, polarity, 
size or any other characteristic known to those skilled 
in the art. See generally , Creighton, Proteins, W.H. 
Freeman and Company, N. Y . , (1984) 498 pp. plus index, 
passim . One may make changes in selected amino acids so 

10 long as such changes preserve the overall folding or 
activity of the protein, (see Table 1, below) . Small 
amino terminal extensions, such as an amino-terminal 
methionine residue, a small linker peptide of up to 
about 20-25 residues, or a small extension that facili- 

15 tates purification, such as a poly-histidine tract, an 
antigenic epitope or a binding domain, may also be 
present. See , in general Ford et al.. Protein Expression 
and Purification 2: 95-107, 1991, which is herein 
incorporated by reference. 
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Table 1 

Pnnsprva tivp Am inn Rfiri Snhsf. i tut ions 



Basic: 


arginine 

lysine 
histidine 


Acidic: 


glutamic acid 
aspartic acid 


Polar: 


glutamine 
asparaqine 


Hydrophobic: 


leucine 
isoleucine 
valine 


Aromatic: 


phenylalanine 
tryptophan 
tyrosine 


Small : 


glycine 
alanine 
serine 
threonine 
1 methionine 



5 Also comprehended are polypeptide fragments 

duplicating only a part of the continuous amino acid 
sequence or secondary conformations within OB receptor, 
which fragments may possess one activity of mammalian 
(particularly human) OB receptor (e.g., immunological 

10 activity) and not others (e.g., OB protein binding 
activity) . 

Of applicability to OB receptor fragments and 
polypeptide analogs of the invention are reports of the 
immunological activity of synthetic peptides which 
15 substantially duplicate the amino acid sequence extant 
in naturally-occurring proteins, glycoproteins and 
nucleoproteins. More specifically, relatively low 
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molecular weight polypeptides have been shown to 
participate in immune reactions which are similar in 
duration and extent to the immune reactions of 
physiologically significant proteins such as viral 
5 antigens, polypeptide hormones, and the like. Included 
among the immune reactions of such polypeptides is the 
provocation of the formation of specific antibodies in 
immunologically active animals. See , e.g.. Lerner et 
.al., Cell 21: 309-310 (1891); Ross et al., Nature 22A: 

10 654-656 (1891); Walter et al., PNAS-USA 22: 5197-5200 
(1980); Lerner et al . , PNAS-USA, 21: 3403-3407 (1891); 
Walter et al., PNAS-USA 2&: 4882-4886 (1891); Wong et 
al., PNAS-USA 22: 5322-5326 (1982); Baron et al . , Cell 
23.: 395-404 (1982); Dressman et al . , Nature 295 : 185-160 

15 (1982); and Lerner, Scientific American 66-74 

(1983) . See , ftlso . Kaiser et al. Science 223 : 249-255 

(1984) relating to biological and immunological 
activities of synthetic peptides which approximately 
share secondary structures of peptide hormones but may 

20 not share their primary structural conformation. The 

present invention also includes that class of polypep- 
tides coded for by portions of the DNA complementary to 
the protein-coding strand of the human cDNA or genomic 
DNA sequences of OB receptor i.e., "complementary 

25 inverted proteins" as described by Tramontano et al. 

Nucleic Acid Res. 12.: 504 9-5059 (1984) . Polypeptides or 
analogs thereof may also contain one or more amino acid 
analogs, such as peptidomimetics . 

Thus, the present class of OB receptor 

30 proteins includes those having amino acids (according to 
Seq. ID No. 1) : 

(a) 1-896; 

(b) 22-896; 

(c) 23-896; 
35 (d) 29-896 

(e) 1-839; 
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10 



15 



20 



25 



30 



35 



(f) 22-839; 

(g) 29-839; 

(h) 1-841; 

(i) 22-841; 
(j) 23-841; 
(k) 29-841; 
(1) 1-891; 
(m) 22-891; 
(n) 23-891, 
(o) 29-891, 
(p) the amino acids of subparts (1) 

through (o) having the C-terminal amino acid sequence 
beginning at position 892 of OB receptor B (Seq. ID 
No- 3) or C <Seq. ID, No. 5) ; 

(q) amino acids of subparts b, c, d # f, 
g, i, j, k, m, n, o, and any of (p) lacking a leader 

sequence, which have an N-terminal methionyl residue. 

Also provided is a longer form of an OB 

receptor protein, herein denominated the "D" form, which 

has an amino acid sequence selected from among 

(according to Seq. ID No, 7) : 

(a) amino acids 1-1165; 

(b) amino acids 22-1165, 

(c) amino acids 23-1165; 

(d) amino acids 29-1165; 

(e) amino acids of subparts (b) , (c) or 
(d) having an N-terminal methionyl residue. 

"As set forth above, one~nnmy--^repa^e^soluble 
receptor by elimination of the transmembrane and intra- 
cellular regions. Examples of soluble receptors include 
those set forth in Seq. ID Nos . 10 and 13. What is 
thought to be a native, secreted form of a soluble human 
OB receptor is also provided herein. This form of OB 
receptor protein has an amino acid sequence selected 
from among (according to Seq. ID No. 13) 

(a) amino acids 1-804; 
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(b) amino acids 22-804; 

(c) amino acids 23-804; 

(d) amino acids 29-804; and, 

(e) amino acids of subparts (b) , (c) or 
5 (d) having an N-terminal methionyl residue. 

In addition, since the C-terminus region of 
the above polyeptides diverges at position 892 (with 
respect to Seq. ID Nos. 1, 3, 5, 7 and 13) one may 
desire to prepare only the polypeptides which are 
10 divergent: 

(a) those having only amino acids 892-896 
of Seq. ID No. 1; 

(b) those having only amino acids 8 92-904 
of Seq. ID No. 3; 

15 (c) those having only amino acids 892-958 

of Seq. ID No. 5; 

(d) those having only amino acids 892- 
1165 of Seq. ID No. 7; and, 

(e) those having only amino acids 799-804 
20 of Seq. ID No. 13. 

The above polypeptides which have an 
extracellular domain may be modified, as indicated 
above, and still retain the function of ligand binding. 
Such modification may include one or more of the 
25 following: 

(a) the random coil domain (as indicated 
above, occur ing downstream of the second hematopoietic 
domain through the beginning of the transmembrane 
domain) may be deleted (this may be approximately 

30 positions 642 through 839 or 841); 

(b) the "WSXWS" box may be modified by 
(i) substitution of the first serine with another amino 
acid, particularly conserved in terms of hydrophobicity 
and/or charge, such as a glycine; (ii) the last serine 

35 may be substituted with another amino acid, such as a 
threonine; (iii) the first tryptophan may be 
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substituted with another amino acid, for example, a 
tyrosine . 

Thus, the present polypeptides include 
(according to the numbering of Seq. ID No. 7) : 
5 (a) 1-896; 

(b) 22-896; 

(c) 23-896; 

(d) 29-896 

(e) 1-839; 
10 <f> 22-839; 

(g) 29-839; 

(h) 1-841; 

(i) 22-841; 
(j) 23-841; 

15 (k) 29-841; 

(1) 1-891; 
(m) 22-891, 
(n) 23-891, 
(o) 29-891, 

20 lp) the amino acids of subparts (1) 

through (o) haying the C-terminal amino acids selected 
from the C-terminal amino acids of OB receptor B (Seq. 
ID No. 3), C (Seq. ID. No. 5) and D (Seq ID No. 7); 

(q) the amino acids (according to Seq. ID 
25 No. 13) selected from the group consisting of 22-804; 
23-804 and 29-804; 

(r) amino acids of subparts b, c, d, f, 
g, i, j, k, m, n, o, any of (p) lacking a leader 
sequence, and (q) which have an N-terminal methionyl 
30 residue; and 

(s) amino acids of subparts (a) through 
(r) which above having at least one of the following 
modifications: 

(i) for amino acids of subparts (a) 
35 through (p) and those of subpart (r) which are not amino 
acids according to subpart (q) , deletion of (or 
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substitution of amino acid(s) or other modifications of) 
a random coil domain sequence selected from 

(a) 640 through 839 (using 
the numbering according to Seq. ID No. 1); 

5 (b) 641 through 839; 

(c) 642 through 839; 

(d) 640 through 841; 

(e) 641 through 841; and 

(f) 642 through 841; 

10 (ii) for amino acids of subpart (q) 

and those of subpart (r) which contain the sequence of 
subpart (q) , deletion of of (or substitution of amino 
acid.(s) or other modifications of) a random coil domain 
sequence selected from among: 

15 (a) 640 through 804; 

(b) 641 through 804; and, 

(c) 642 through 804; 

and, 

(iii) modification of a "WSXWS" 

20 sequence which is 

(a) substitution of the first 
serine with another amino acid, particularly conserved 
in terms of hydrophobicity and/or charge, such as a 
glycine; 

25 (b) substition of the last 

serine with another amino acid, such as a threonine; 
and 

(c) substitution of the first 
tryptophan with another amino acid, for example, a 
30 tyrosine. 

One may modify the OB receptor to create a 
fusion molecule with other peptide sequence . For 
example, if one desired to "tag" the OB receptor with an 
immunogenic peptide, one could construct a DNA which 
35 would result in such fusion protein. The tag may be at 
the N-terminus. Also, since it is apparent that the 
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C-terminus is not necessary for ligand binding activity, 
one may chemically modify the C-terminus of, for 
example, a soluble OB receptor. One may desire, for 
example, a preparation whereby one or more polymer 
5 molecules such as polyethylene glycol molecules are 

attached. Thus, another aspect of the present invention 
is chemically modified OB receptor protein (also further 
described infra) • 

An example of such "tag" is provided herein 
10 using the C-terminus of a recombinant soluble OB 

receptor. Seq. ID No. 12 provides a "FLAG-tag" v ersion 
of such soluble OB receptor (the nucleic acid sequence 
is provided, which may be' transcribed to prepare the 
polypeptide) . Such "FLAG-tag" may also be attached to 
15 the N-terminus or other region of an OB receptor 

protein. This type of "tagging" is useful to bind the 
protein using reagents, such as antibodies, which are 
selective for such tag. Such binding may be for 
detection of the location or amount of protein, or for 
20 protein capturing processes where, for example, an 

affinity column is used to bind the tag, and thus the 
desired protein. Other types of detectable labels, such 
as radioisotopes, light-emitting (e.g-, fluorescent or 
phosporescent compounds) , enzymatically cleavable, 
25 detectable antibody (or modification thereof) , or other 
substances may be used for such labelling of the present 
proteins. Detecting protein via use of the labels may 
be useful for identifying the presence or amount of OB 
recepto r protein or a compound containin g such protein 
30 (e.g< ^OB_protein complexed to OB recepto rp Moreover, 
such labelled protein may be useful for distinguishing 
exogenous OB receptor protein from the endogenous form. 
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Nucleic Acids 

Novel nucleic acid sequences of the invention 
include sequences useful in securing expression in 
5 procaryotic or eucaryotic host cells of polypeptide 
products having at least a part of the primary struc- 
tural conformation and one or more of the biological 
properties of recombinant human OB receptor. The 
. nucleic acids may be purified and isolated, so that the 

10 desired coding region is useful to produce the present 
polypeptides, for example, or for diagnostic purposes, 
as described more fully below. DNA- sequences of the 
invention specifically comprise: (a) any of the DNA 
sequences set forth in Seq. ID No. 2, 4, 6, 8, 9, 11, 

15 12, and 14 <and complementary strands) ; (b) a DNA 
sequence which hybridizes (under hybridization 
conditions disclosed in the cDNA library screening 
section below, using the 300 bp PCR fragment as 
described to selectively hybridize to a cDNA encoding an 

20 OB receptor protein in a human liver cDNA library, or 
equivalent conditions or more stringent conditions) to 
the DNA sequence in subpart (a) or to fragments thereof; 
and (c) a DNA sequence which, but for the degeneracy of 
the genetic code, would hybridize to the DNA sequence in 

25 subpart (a) . Specifically comprehended in parts (b) and 
(c) are genomic DNA sequences encoding allelic variant 
forms of human OB receptor and/or encoding OB receptor 
from other mammalian species, and manufactured DNA 
sequences encoding OB receptor, fragments of OB 

30 receptor, and analogs of OB receptor which DNA sequences 
may incorporate codons facilitating transcription and 
translation of messenger RNA in microbial hosts. Such 
manufactured sequences may readily be constructed 
according to the methods of Alton et al., PCT published 

35 application WO 83/04053, 
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Genomic DNA, such as that of Seq. ID No. 9, 
encoding the present OB receptors may contain additional 
non-coding bases, or introns, and such genomic DNAs are 
obtainable by hybridizing all or part of the cDNA, 
*5 illustrated in Seq. ID Nos. 2, 4, 6, 8, 11, and 14 to a 
genomic DNA source, such as a human genomic DNA library. 
- Such genomic DNA will encode functional OB receptor 
polypeptide; however, use of the cDNAs may be more 
practicable in that, since only the coding region is 
10 involved, recombinant manipulation is facilitated. The 

intron/exon location of genomic DNA is set forth in Seq. 
ID No. 9, infra. 

Nucleic acid sequences include the 
incorporation of codons which enhance expression by 
15 selected nonmammalian hosts; the provision of sites for 
cleavage by restriction endonuclease enzymes; and the 
provision of additional initial, terminal or 
intermediate DNA sequences which facilitate construction 
of cloning and/or expression vectors, 
20 The present invention also provides DNA 

sequences coding for polypeptide analogs or derivatives 
of OB receptor which differ from naturally-occurring 
forms in terms as described above. The leader sequence 
DNA may be substituted with another leader sequence for 
25 ease in expression or for other purposes. 

Also, one may prepare antisense nucleic acids 
against the present DNAs. Such antisense nucleic acids 
may be useful in modulating the effects of OB receptor 
protein An vivo . For example, one may prepare an 
30 antisense nucleic acid which effectively disables the 
ability of a cell to produce OB receptor by binding to 
the nucleic acid which encodes such OB receptor. 

DNA sequences of the invention are also 
suitable materials for use as labeled probes in 
35 isolating human genomic DNA encoding OB receptor, as 
mentioned above, and related proteins as well as cDNA 
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and genomic DNA sequences of other mammalian species. 
DNA sequences may also be useful in various alternative 
methods of protein synthesis (e.g., in insect cells) or, 
as described infra , in genetic therapy in humans and 
5 other mammals. DNA sequences of the invention are 

expected to be useful in developing transgenic mammalian 
species which may serve as eucaryotic "hosts" for 
production of OB receptor and OB receptor products in 
quantity. See, generally, Palmiter et al . , Science 222 : 
10 809-814 (1983) . 

Vectors and Host Cells 

According to another aspect of the present 
invention, the DNA sequences described herein which 

15 encode OB receptor polypeptides are valuable for the 

information which they provide concerning the amino acid 
sequence of the mammalian protein which have heretofore 
been unavailable. Put another way, DNA sequences 
provided by the invention are useful in generating new 

20 and useful viral and circular plasmid DNA vectors, new 
and useful transformed and transfected procaryotic and 
eucaryotic host cells (including bacterial cells, yeast 
cells, insect cells, and mammalian cells grown in 
culture) , and new and useful methods for cultured growth 

25 of such host cells capable of expression of OB receptor 
and its related products. 

The DNA provided herein (or corresponding 
RNAs) may also be used for gene therapy for, example, 
treatment of conditions characterized by the 

30 overexpression of OB protein, such as anorexia or 

cachexia. Alternatively, gene therapy may be used in 
cases where increased sensitivity to OB protein is 
desired, such as in cases where an individual has a 
condition characterized by OB protein receptors 

35 defective in ability to bind or retain the binding of OB 
protein. Currently, vectors suitable for gene therapy 
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(such as retroviral or adenoviral vectors modified for 
gene therapy purposes and of purity and pharmaceutical 
acceptability) may be administered for delivery into the 
lung, for example. Such vectors may incorporate nucleic 
5 acid encoding the present polypeptides for expression in 
a desired location. Gene therapy may involve more than 
one gene for a desired protein or different desired 
proteins. 

Alternatively, one may use no vector so as to 
10 facilitate relatively stable presence in the host. For 
example, homologous recombination of a DNA as provided 
herein or of a suitable transcription or translation 
control region may facilitate integration into or 
expression from a- host genome. (This may be performed 
15 for production purposes as well, e.g., U.S. Patent 

No. 5,272,071 and WO 91/09955.) The nucleic acid may be 
placed within a pharmaceutical^ acceptable carrier to 
facilitate cellular uptake, such as a lipid solution 
carrier (e.g., a charged lipid), a liposome, or 
20 polypeptide carrier (e.g., polylysine) . A review 

article on gene therapy is Verma, Scientific American, 
November 1990, pages 68-84 which is herein incorporated 
by reference . 

Thus, the present invention provides for a 
25 population of cells expressing an OB receptor of the 

present OB receptor family. Such cells are suitable for 
transplantation or implantation into an individual for 
therapeutic purposes. For example, one may prepare a 
population of cells to overexpress OB receptor (such as 
30 one identified in the Sequence ID'S or otherwise denoted 
herein), or to express a desired form of OB receptor, 
such as one which is particularly sensitive to OB 
protein (i.e., a form which has a desired capacity for 
signal transduction) . One may then implant such cells 
35 into an individual to increase that individual's 

sensitivity to OB protein. Such cells may, for example, 
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be liver cells, bone marrow cells, or cells derived from 
umbillical cord. Alternatively, one may wish to use 
overexpressing circulating cells such as blood 
progenitor cells, T cells or other blood cells. For 
5 humans, human cells may be used. Cells may be in the 
form of tissue. Such cells may be cultured prior to 
transplantation or implantation. Such OB receptor 
overexpression, or expression of particularly sensitive 
forms of OB receptor may be accomplished by, for 

10 example, altering the regulatory mechanism for 

expression of OB receptor, such as using homologous 
recombination techniques as described supra . Thus, 
provided is a population of host cells modified so that 
expression of endogenous OB receptor DNA is enhanced. 

15 The cells to be transferred to the recipient 

may be cultured using one or more factors affecting the 
growth or proliferation of such cells if appropriate. 
Hematopoietic factors may be used in culturing 
hematopoietic cells. Such factors include G-CSF, EPO, 

20 MGDF, SGF, Flt-3 ligand, inter leukins (e.g., IL1-IL13) , 
GM-CSF, LIF, and analogs and derivatives thereof as 
available to one skilled in the art. 

Nerve cells, such as neurons or glia, may also 
be used, and these may be cultured with neurotrophic 

25 factors such as BDNF, CNTF, GDNF, NT3, or others. 

There may be a co-gene therapy involving the 
transplantation of cells expressing more than one 
desired protein. For example, cells expressing OB 
receptor protein may be used in conjunction, 

30 simultaneously or in serriatim with cells expressing OB 
protein. 

For gene therapy dosages, one will generally 
use between one copy and several thousand copies of the 
present nucleic acid per cell, depending on the. vector, 
35 the expression system, the age, weight and condition of 
the recipient and other factors which will be apparent 
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to those skilled in the art. The cellular delivery of 
such protein may be designed to last for a selected 
period of time, such as a period of days, weeks, months 
or years. At the end of the effective time period, the 
5 recipient of such transformed cells may receive another 
"dose" (e.g., transplantation of cells). Cells may be 
selected for their lifespan, their time period of 
expression of the desired protein, or their ability to 
be reisolated from an individual (i.e., for blood cells, 
10 leukaphoresis may be used to retrieve transformed cells 
using markers present on the cell surface) . Vectors may 
be similiarly designed using, for example, viruses which 
have a known period of expression of DNAs contained 
therein. 

15 The desired cells or vectors may be stored 

using techniques, such as freezing, available to those 
in the art. 

Thus, the present invention also contemplates 
a method for administering OB receptor protein to an 
20 individual, wherein the source of said OB receptor 
protein is selected from (i) a population of cells 
expressing OB receptor protein and (ii) a population of 
vectors expressing OB receptor protein. Said OB 
receptor protein may be selected from among those 
25 described herein. Said vectors may be virus vectors 
capable of infecting human cells. Said cells may be 
selected from among tissue or individual cells. Said 
individual cells may be selected from among adipocytes, 
fibroblasts, bone marrow cells, peripheral blood 
30 progenitor cells, red blood cells, and white blood 
cells, including T cells and nerve cells. Said 
population of cells or vectors may be co-administered 
with a population of cells or vectors which express OB 
protein or another desired protein. Said cells or 
35 vectors may be stored for use in an individual. Storage 
may be by freezing 
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In addition to the OB receptor protein as 
described herein, one may prepare complexes of OB 
5 receptor protein and OB protein, analog or derivative. 
The OB protein may be selected from those 
described in PCT publication WO 96/05309, above and 
hereby incorporated by reference in its entirety. 
Figure 3 of that publication (Seq. ID No, 4, as cited 

10 therein) depicts the full deduced amino acid sequence 
derived for the human OB gene. The amino acids are 
numbered from 1 to 167. A signal sequence cleavage site 
is located after amino acid 21 (Ala) so that the mature 
protein extends from amino acid 22 (Val) to amino acid 

15 167 (Cys) . For the present disclosure, a different 

numbering is used herein, where the amino acid position 
1 is the Valine residue which is at the beginning of the 
mature protein . 

Generally, the OB protein for use will be 

20 capable of complexing to the OB protein receptor 

selected. Thus, one may empirically test the binding 
capability (to all or part of the extracellular domain 
of the OB receptor as indicated above) to determine 
which OB protein forms may be used. Generally, 

25 modifications generally applicable as indicated above 
for OB receptor protein may also be applied here, and 
that disclosure is incorporated by reference here. As 
set forth in WO 96 05309, OB protein in its native form, 
or fragments (such as enzyme cleavage products) or other 

30 truncated forms, analogs, and derivatives all retain 

biological activity. Such forms may be used so long as 
the form binds to at least a portion of the 
extracellular domain of the present OB receptor 
proteins . 

35 An effective amount of an OB protein, analog 

. or derivative thereof may be selected from among 
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according to the amino acid sequence as presented in PCT 
WO 96/05309, Figure 3 numbered so that the first amino 
acid of the mature protein is number 1: 

(a) the amino acid sequence 1-146, 

5 optionally lacking a glutaminyl residue at position 28, 
and further optionally having a methionyl residue at the 
N-terminus; 

(b) an amino acid sequence of subpart 
(a) having a different amino acid substituted in one or 

10 more of the following positions: 4, 8, 32, 33, 35, 48, 
50, 53, 60, 64, 66, 67, 68, 71, 74, 77, 78, 89, 97, 
100, 102, 105, 106, 107, 108, 111, 112, 118, 136, 138, 
142, and 145; 

(c) a truncated OB protein analog 

15 selected from among: (using the numbering of subpart (a) 
above) : 

(i) amino acids 98-146 
<ii) amino acids 1-32 
(iii) amino acids 1-35 
20 (iv) amino acids 40-116 

(v) amino acids 1-99 and 112-146 
(vi) amino acids 1-99 and 112-146 
having one or more of amino acids 100-111 
sequentially placed between amino acids 99 and 112; 
25 and, 

(vii) the truncated OB analog of 
subpart (i) having one or more of amino acids 100, 
102, 105, 106, 107, 108, 111, 112, 118, 136, 138, 
142, and 145 substituted with another amino acid; 
30 (viii) the truncated analog of subpart 

(ii) having one or more of amino acids 4, 8 and 32 
substituted with another amino acid; 

(ix) the truncated analog of subpart 
(iv) having one or more of amino acids 50, 53, 60, 
35 64, 66, 67, 68, 71, 74, 77, 78, 89, 97, 100, 102, 
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105, 106, 107, 108, 111 and 112 replaced with 
another amino acid; 

(x) the truncated analog of subpart 

(v) having one or more of amino acids 4, 8, 32, 33, 
5 35, 48, 50, 53, 60, 64, 66, 67, 68, 71, 74, 77, 

78, 89, 97, 112, 118, 136, 138, 142, and 145 
replaced with another amino acid; 

(xi) the truncated analog of subpart 

(vi) having one or more of amino acids 4, 8,32, 33, 
10 35, 48, 50, 53, 60, 64, 66, 67, 68, 71, 74, 77, 

78, 89, 97, 100, 102, 105, 106, 107, 108, 111, 
112, 118, 136, 138, 142, and 145 replaced with 
another amino acid; . 

(xii) the truncated analog of any of 
15 subparts (i)-(xi) having an N-terminal methionyl 

residue; and 

(d) the OB protein or analog derivative 
of any of subparts (a) through (c) comprised of a 
chemical moiety connected to the protein moiety; 
20 (e) a derivative of subpart (d) wherein 

said chemical moiety is a water soluble polymer moiety; 

(f) a derivative of subpart (e) wherein 
said water soluble polymer moiety is polyethylene 
glycol; 

25 (g) A derivative of subpart (f) wherein 

said water soluble polymer moiety is a polyamino acid 
moiety; 

(h) a derivative of subpart (g) wherein 
said water soluble polymer moiety is attached at solely 

30 the N-terminus of said protein moiety; 

(i) an OB protein, analog or derivative 
of any of subparts (a) through (h) in a pharmaceutically 
acceptable carrier. 

OB proteins, analogs and related molecules are 
35 also reported in the following publications; however, no 
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representation is made with regard to the activity of 
any composition reported: 

U.S. Patent Nos. 5,521,283; 5,532,336; 
5,552,522; 5,552,523; 5,552,524; 5,554,727; 
5 5,559,208; 5,563,243; 5,563,244; 5,563,245; 

5,567,678; 5,567,803; 5,569,744; 5,569,743 

(all assigned to Eli Lilly and Company) ; 
PCT W096/23517; W096/23515; W096/23514; 
WO96/24670; W096/23513; W096/23516; 
10 W096/23518; W096/23519; WO96/23520; 

W096/23815; WO96/24670; W096/27385 (all 

assigned to Eli Lilly and Company); 
PCT WO96/22308 (assigned to Zymogenet ics) ; 
PCT WO96/29405 (assigned to Ligand 
15 Pharmaceuticals, Inc.); 

PCT W096/31526 (assigned to Amyin 

Pharmaceuticals, Inc. ) ; 
PCT W096/34885 (assigned to Smithkline Beecham 
PLC) ; 

20 PCT W096/35787 (assigned to Chiron) ; 

EP 0 725 079 (assigned to Eli Lilly and 
Company) ; 

EP 0 725 078 (assigned to Eli Lilly and 
Company) ; 

25 EP 0 736 599 (assigned to Takeda) ; 

EP 0 741 187 (assigned to F. Hoffman LaRoche) . 



To the extent these references provide for 
useful OB proteins or analogs or derivatives thereof, or 

30 associated compositions or methods, such compositions 
and/or methods may be used in conjunction with the 
present OB receptor proteins, such as for co- 
administration (together or separately, in a selected 
dosage schedule) or by complexing compositions to the 

35 present OB protein receptors. With the above provisos, 
these publications are herein incorporated by reference. 
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Derivative and Formulae ions 

The present OB protein receptor and/or OB 
protein (herein the term "protein" is used to include 
5 "peptide" and OB protein or receptor analogs, such as 
those recited infra , unless otherwise indicated) may 
also be derivatized by the attachment of one or more 
chemical moieties to the protein moiety. If the present 
pharmaceutical compositions contain as the active 

10 ingredient a complex of OB protein receptor and OB 
protein, one or both of such proteins may be 
derivatized. The chemically modified derivatives may be 
further formulated for intraarterial, intraperitoneal, 
intramuscular, subcutaneous, intravenous, oral, nasal, 

15 pulmonary, topical or other routes of administration. 
Chemical modification of biologically active proteins 
has been found to provide additional advantages under 
certain circumstances, such as increasing the stability 
and circulation time of the therapeutic protein and 

20 decreasing immunogenicity . See U.S. Patent 

No. 4,179,337, Davis et al . , issued December 18, 1979. 
For a review, see Abuchowski et al . , in Enzymes as 
Drugs. (J.S. Holcerberg and J. Roberts, eds . 
pp. 367-383 (1891)). A review article describing 

25 protein modification and fusion proteins is Francis, 

Focus on Growth Factors 2* 4-10 (May 1992) (published by 
Mediscript, Mountview Court, Friern Barnet Lane, London 
N20, OLD, UK) . 

Preferably, for therapeutic use of the 

30 end-product preparation, the chemical moiety for 

derivatization will be pharmaceutical ly acceptable. A 
polymer may be used. One skilled in the art will be 
able to select the desired polymer based on such 
considerations as whether the polymer /protein conjugate 

35 will be used therapeutically, and if so, the desired 

dosage, circulation time, resistance to proteolysis, and 
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other considerations. For the present proteins and 
peptides, the effectiveness of the derivatization may be 
ascertained by administering the derivative, in the 
desired form (i.e., by osmotic pump, or by injection or 
5 infusion, or, further formulated for oral, pulmonary or 
nasal delivery, for example) , and observing biological 
effects as described herein. 

The chemical moieties suitable for 
derivatization may be selected from among various water 
10 soluble polymers- The polymer selected should be water 
soluble so that the protein to which it is attached so 
that it is miscible in an aqueous environment, such as a 
physiological environment. The water soluble polymer 
may be selected from the group consisting of, for 
15 example, polyethylene glycol, copolymers of ethylene 
glycol/propylene glycol, carboxymethylcellulose, 
dextran, polyvinyl alcohol, polyvinyl pyrolidone, 
poly-1, 3-dioxolane, poly-1, 3, 6-trioxane, 
ethylene/maleic anhydride copolymer, polyaminoacids 
20 (either homopolymers or random or non-random copolymers 
( see supra regarding fusion molecules) , and dextran or 
poly(n-vinyl pyrolidone) polyethylene glycol, propylene 
glycol homopolymers, polypropylene oxide/ethylene oxide 
co-polymers, polyoxyethylated polyols, 
25 polystyrenemaleate and polyvinyl alcohol. Polyethylene 
glycol propionaldenhyde may have advantages in 
manufacturing due to its stability in water. 

Fusion proteins may be prepared by attaching 
polyaminoacids to the OB protein receptor or OB protein 
30 (or analog or complex) moiety. For example, the 

polyamino acid may be a carrier protein which serves to 
increase the circulation half life of the protein. For 
the present therapeutic or cosmetic purposes, such 
polyamino acid should be those which do not create 
35 neutralizing antigenic response, or other adverse 

response- Such polyamino acid may be selected from the 
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group consisting of serum album (such as human serum 
albumin), an antibody or portion thereof (such as an 
antibody constant region, sometimes called u Fc n ) or 
other polyamino acids. As indicated below, the location 
5 of attachment of the polyamino. acid may be at the 

N-terminus of the OB protein moiety, or other place, and 
also may be connected by a chemical "linker" moiety to 
the OB protein. 

The polymer may be of any molecular weight, 

10 and may be branched or unbranched. For polyethylene 

glycol, the preferred molecular weight is between about 
2 kDa and about 100 kDa (the term "about" indicating 
that in preparations of polyethylene glycol, some 
molecules will weigh more, some less, than the stated 

15 molecular weight) for ease in handling and 

manufacturing. Other sizes may be used, depending on 
the desired therapeutic profile (e.g., the duration of 
sustained release desired, the effects, if any on 
biological activity, the ease in handling, the degree or 

20 lack of antigenicity and other known effects of the 

polyethylene glycol to a therapeutic protein or analog) . 

The number of polymer molecules so attached 
may vary, and one skilled in the art will be able to 
ascertain the effect on function. One may mono- 

25 derivatize, or may provide for a di-, tri-, tetra- or 
some combination of derivatization, with the same or 
different chemical moieties (e.g., polymers, such as 
different weights of polyethylene glycols) . The 
proportion of polymer molecules to protein (or peptide) 

30 molecules will vary, as will their concentrations in the 
reaction mixture. In general, the optimum ratio (in 
terms of efficiency of reaction in that there is no 
excess unreacted protein or polymer) will be determined 
by factors such as the desired degree of derivatization 

35 (e.g., mono, di-, tri-, etc.), the molecular weight of 
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the polymer selected, whether the polymer is branched or 
unbranched, and the reaction conditions. 

The chemical moieties should be attached to 
the protein with consideration of effects on functional 
5 or antigenic domains of the protein. There are a number 
of attachment methods available to those skilled in the 
art. E.g. , EP 0 401 384 herein incorporated- by reference 
(coupling PEG to G-CSF) , £££ Malik et al., Exp. 

Hematol. 2SL : 1028-1035 (1992) (reporting pegylation of 
10 GM-CSF using tresyl chloride) . For example, 

polyethylene glycol may be covalently bound through 
amino acid residues via a reactive group, such as, a 
free amino or carboxyl group. Reactive groups are those 
to which an activated polyethylene glycol molecule (or 
15 other chemical moiety) may be bound. The amino acid 
residues having a free amino group may include lysine 
residues and the N-terminal amino acid residue. Those 
having a free carboxyl group may include aspartic acid 
residues, glutamic acid residues, and the C-terminal 
20 amino acid residue. Sulfhydrl groups may also be used 
as a reactive group for attaching the polyethylene 
glycol molecule (s) (or other chemical moiety). 
Preferred for therapeutic manufacturing purposes is 
attachment at an amino group, such as attachment at the 
25 N-terminus or lysine group. Attachment at residues 
important for receptor binding should be avoided if 
receptor binding is desired. 

One may specifically desire N-terminally 
chemically modified protein. Using polyethylene glycol 
30 as an illustration of the present compositions, one may 
select from a variety of polyethylene glycol molecules 
(by molecular weight, branching, etc.), the proportion 
of polyethylene glycol molecules to protein molecules in 
the reaction mix, the type of pegylation reaction to be 
35 performed, and the method of obtaining the selected 

N-terminally pegylated protein. The method of obtaining 
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the N-terminally pegylated preparation (i.e., 
separating this moiety from other monopegylated moieties 
if necessary) may be by purification of the N-terminally 
pegylated material from a population of pegylated 
5 protein molecules. Selective N-terminal chemical 

modification may be accomplished by reductive alkyiation 
which exploits differential reactivity of different 
types of primary amino groups (lysine versus the 
N-terminal) available for derivatization in a particular 

10 protein. PCT WO 96/11953/ herein incorporated by 

reference. Under the appropriate reaction conditions, 
substantially selective derivatization of the protein at 
the N-terminus with a carbonyl group containing polymer 
is achieved. For example, one may selectively 

15 N-terminally pegylate the protein by performing the 

reaction at a pH which allows one to take advantage of 
the pK a differences between the e-amino group of the 

lysine residues and that of the a-amino group of the 
N-terminal residue of the protein. By such selective 

20 derivatization, attachment of a polymer to a protein is 
controlled: the conjugation with the polymer takes 
place predominantly at the N-terminus of the protein and 
no significant modification of other reactive groups, 
such as the lysine side chain amino groups, occurs. 

25 Using reductive alkyiation, the polymer may be of the 

type described above, and should have a single reactive 
aldehyde for coupling to the protein. Polyethylene 
glycol propionaldehyde, containing a single reactive 
aldehyde, may be used. 

30 An N-terminally chemically modified derivative 

is preferred (over other forms of chemical modification) 
for ease in production of a therapeutic. N-terminal 
chemical modification ensures a homogenous product as 
characterization of the product is simplified relative 

35 to di-, tri- or other multi-derivatized products. The 
use of the above reductive alkyiation process for 
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preparation of an N-terminally chemically modified 
product, is preferred for ease in commercial 
manufacturing . 

In yet another aspect of the present 
5 invention, provided are methods of using pharmaceutical 
compositions of the proteins, and derivatives. Such 
pharmaceutical compositions may be for administration by 
injection, or for oral, pulmonary, nasal, transdermal or 
other forms of administration. In general, comprehended 
10 by the invention are pharmaceutical compositions 

comprising effective amounts of protein or derivative 
products of the invention together with pharmaceutical^ 
acceptable diluents, preservatives, solubilizers, 
emulsifiers, adjuvants and/or carriers. Such 
15 compositions include diluents of various buffer content 
(e.g., Tris-HCl, acetate, phosphate), pH and ionic 
strength; additives such as detergents and solubilizing 
agents (e.g., Tween 80, Polysorbate 80), anti-oxidants 
(e.g., ascorbic acid, sodium metabisulf ite) , 
20 preservatives (e.g., Thimersol, benzyl alcohol) and 
bulking substances (e.g., lactose, mannitol) ; 
incorporation of the material into particulate 
preparations of polymeric compounds such as polylactic 
acid, polyglycolic acid, etc. or into liposomes. Sfi£, 
25 e.g. , PCT W096/29989, Collins et al., "Stable protein: 
phospholipid compositions and methods, " published 
October 3, 1996, herein incorporated by reference. 
Hylauronic acid may also be used, and this may have the 
effect of promoting sustained duration in the 
30 circulation. Such compositions may influence the 

physical state, stability, rate of in viVQ release, and 
rate of in vivo clearance of the present proteins and 
derivatives. See , e.g . . Remington's Pharmaceutical 
Sciences, 18th Ed. (1990, Mack Publishing Co., Easton, 
35 PA 18042) pages 1435-1712 which are herein incorporated 
by reference. The compositions may be prepared in 
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liquid form, or may be in dried powder, such as 
lyophilized form. Implantable sustained release 
formulations are also contemplated, as are transdermal 
formulations . 

5 Specifically contemplated are oral dosage 

forms of the above derivatized proteins. Protein may be 
chemically modified so that oral delivery of the 
derivative is efficacious. Generally, the chemical 
modification contemplated is the attachment of at least 

10 one moiety to the protein (or peptide) molecule itself, 
where said moiety permits (a) inhibition of proteolysis; 
and (b) uptake into the blood stream from the stomach or 
intestine. Also desired As the increase in overall 
stability of the protein and increase in circulation 

15 time in the body. See PCT W095/21629, Habberfield, 
"Oral Delivery of Chemically Modified Proteins" 
(published August 17, 1995) herein incorporated by 
reference, and U.S. Patent No. 5, 574, 018, Habberfield et 
al., "Conjugates of Vitamin B12 and Proteins," issued 

20 November 12, 1996, herein incorporated by reference. 

Also contemplated herein is pulmonary delivery 
of the present protein, or derivative thereof. The 
protein (derivative) is delivered to the lungs of a 
mammal while inhaling and traverses across the lung 

25 epithelial lining to the blood stream. See, PCT 

WO94/20069, Niven et al., "Pulmonary administration of 
granulocyte colony stimulating factor, " published 
September 15, 1994, herein incorporated by reference. 

Nasal delivery of the protein (or analog or 

30 derivative) is also contemplated. Nasal delivery allows 
the passage of the protein to the blood stream directly 
after administering the therapeutic product to the nose, 
without the necessity for deposition of the product in 
the lung. Formulations for nasal delivery include those 

35 with absorption enhancing agents, such as dextran or 
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cyclodextran. Delivery via transport across other 
mucous membranes is also contemplated. 



Dosages 

5 One skilled in the art will be able to 

ascertain effective dosages by administration and 
observing the desired therapeutic effect. Preferably, 
the formulation of the molecule or complex in a 
pharmaceutical composition will be such that between 
10 about .10 ng/kg/day and 10 mg/kg/day will yield the 

desired therapeutic effect. The effective dosages may 
be determined using diagnostic tools over time. For 
example, a diagnostic for measuring the amount of OB 
protein or OB receptor protein in the blood (or plasma 
15 or serum) may first be used to determine endogenous 
levels of OB protein (or receptor) . Such diagnostic 
tool may be in the form of an antibody assay, such as an 
antibody sandwich assay. The amount of endogenous OB 
receptor protein (such as soluble receptor) is 
20 quantified initially, and a baseline is determined. The 
therapeutic dosages are determined as the quantification 
of endogenous and exogenous OB receptor protein (that 
is, protein, analog or derivative found within the body, 
either self-produced or administered) is continued over 
25 the course of therapy. The dosages may therefore vary 
over the course of therapy, with a relatively high 
dosage being used initially, until therapeutic benefit 
is seen, and lower dosages used to maintain the 
therapeutic benefits. 
30 During an initial course of therapy of an 

obese person, dosages may be administered whereby weight 
loss and concomitant fat tissue decrease increase is 
achieved. Once sufficient weight- loss is achieved, a 
dosage sufficient to prevent re-gaining weight, yet 
35 sufficient to maintain desired weight or fat mass may be 
administered. These dosages can be determined 
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empirically, as the effects of OB protein are 
reversible. E.g. , Campfield et al . , Science 26? : 546-549 
(1995) at 547 # Thus, if a dosage resulting in weight 
loss is observed when weight loss is not desired, one 
5 would administer a lower dose, yet maintain the desired 
weight . 

Therapeutic Compositions and Methods 

The present OB receptor proteins, alone, or in 

10 combination with an OB protein, and nucleic acids may be 
used for methods of treatment, or for methods of 
manufacturing medicaments for treatment. Such treatment 
includes conditions characterized by excessive 
production of OB protein, wherein the present OB 

15 receptors, particularly in soluble form, may be used to 
complex to and therefore inactivate such excessive OB 
protein. Or, such OB receptor protein, particularly in 
soluble form, may act to protect the activity of OB 
protein. While not wishing to be bound by theory, one 

20 may postulate that OB protein receptor agonist activity 
may be accomplished by a protective effect achieved when 
OB protein receptor (particularly soluble receptor) is 
complexed to OB protein. Such effect may prolong the 
serum half life of OB protein ia vivo . Such treatments 

25 may be accomplished by preparing soluble receptor (e.g., 
use of an extracellular domain as described supra ) and 
administering such composition to an individual in need 
thereof or by preparation of a population of cells 
containing or expressing such OB receptor, and 

30 transplanting such cells into the individual in need 
thereof. 

The present OB receptors may also be used for 
treatment of those having defective OB receptors. For 
example, one may treat an individual having defective OB 
35 receptors by preparation of a population of cells 
containing such non-defective OB receptor, and 
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transplanting such cells into an individual- Or, an 
individual may have an inadequate number of OB 
receptors, and cells containing such receptors may be 
transplanted in order to increase the number of OB 
* 5 receptors available to an individual. 

The present OB receptor proteins and related 
• compositions such as OB receptor protein/OB protein 
complex, provide for weight loss, fat loss, increase in 
lean mass, increase in insulin sensitivity, increase in 
10 overall strength, increase in red blood cells (and 

oxygenation in the blood) , decrease in bone resportion 
or osteoporosis, decreased or maintained serum 
cholesterol level, decreased or maintained triglyceride 
<LDL or VLDL) levels, prevention or reduction in 
15 arterial plaque formation, treatment of hypertension, 

and prevention or reduction of gall stone formation. As 
body fat composition may be correlated with certain 
types of cancers, the present compositions may be useful 
for the prevention or amelioration of certain types of 
20 cancers. The present invention also includes methods 
for manufacture of a medicament for use in conjunction 
with the cosmetic/therapeutic conditions described 
herein, containing at least one of the present 
compositions . 

25 The present compositions and methods may be 

used in conjunction with other medicaments, such as 
those useful for the treatment of diabetes (e.g., 
insulin or analogs thereof, thiazolidinediones or other 
antihyperglycemic agents, and possibly amylin or 

30 antagonists there of) , cholesterol and blood pressure 
lowering medicaments (such as those which reduce blood 
lipid levels or other cardiovascular medicaments), and 
activity increasing medicaments (e.g., amphetamines). 
Appetite suppressants may also be used (such as 

35 serotonin modulators and neuropeptide Y antagonists) . 
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Such administration may be simultaneous or may be in 

seriatim- 

In addition, the present methods may be used 
in conjunction with surgical procedures, such as 
5 cosmetic surgeries designed to alter the overall 
appearance of a body (e.g., liposuction or laser 
surgeries designed to reduce body mass, or implant 
surgeries designed to increase the appearance of body 
mass) . The health benefits of cardiac surgeries, such 

10 as bypass surgeries or other surgeries designed to 

relieve a deleterious condition caused by blockage of 
blood vessels by fatty deposits, such as arterial 
plaque, may be increased -with concomitant use of the 
present compositions and methods. Methods to eliminate 

15 gall stones, such as ultrasonic or laser methods, may 
also be used either prior to, during or after a course 
of the present therapeutic methods. Furthermore, the 
present methods may be used as an adjunct to surgeries 
or therapies for broken bones, damaged muscle, or other 

20 therapies which would be improved by an increase in lean 
tissue mass . 

In yet another aspect, the present invention 
provides for methods of manufacture of a medicament for 
the treatment of obesity, type II diabetes, excess blood 

25 lipid, or cholesterol levels, increasing sensitivity to 
insulin, increasing lean mass, and other conditions as 
set forth above. Also provided are solely cosmetic 
treatments for individuals wishing to improve appearance 
by weight loss, and more specifically, loss of fat 

30 deposits, even in the absence of any therapeutic 
benefit . 

Diagnostic Compositions and Methods 

As indicated supra . polypeptide products of 
35 the invention may be "labeled" by association with a 
detectable marker substance (e.g., radiolabeled with 
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10 



125 I# fluorescent, chemiluminescent , enzyme) to provide 
reagents useful in detection and quantification of OB 
receptor (or complexes) in solid tissue and fluid 
samples such as blood or urine. Nucleic acid products 
of the invention may also be labeled with detectable 
markers (such as radiolabels and non-isotopic labels 
such as biotin) and employed in hybridization processes 
to locate the human OB receptor gene position and/or the 
position of any related gene family in a chromosomal 
map. Nucleic acid sequences which selectively bind the 
human OB receptor gene are useful for this purpose. 
They may also be used for identifying human OB receptor 
gene disorders at the DNA level and used as gene markers 
for identifying neighboring genes and their disorders. 
15 Such nucleic acid sequences may be sued for detection or 
measurement of OB receptor mRNA level from a biological 
sample. Contemplated herein are kits containing such 
labelled materials. 

The protein and/or nucleic acids provided 
20 herein may also be embodied as part of a kit or article 
of manufacture-. Contemplated is an article of 
manufacture comprising a packaging material and one or 
more preparations of the presently provided 
compositions. Such packaging material will comprise a 
25 label indicating that the protein or nucleic acid 

preparation is useful for detecting and/or quantifying 
the amount of OB receptor in a biological sample, or OB 
receptor defects in a biological sample. As such, the 
kit may optionally include materials to carry out such 
30 testing, such as reagents useful for performing DNA or 
RNA hybridization analysis, or PCR analysis on blood, 
urine, or tissue samples. 

A further embodiment of the invention is 
selective binding molecules, such as monoclonal 
35 antibodies selectively binding OB receptor. The 
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hybridoma technique described originally by Kohler and 
Milstein Eur. J. Immunol. £, 511-519 (1976) has been 
widely applied to produce hybrid cell lines that secrete 
high levels of monoclonal antibodies against many 
5 specific antigens. Recombinant antibodies, ( see Huse et 
al., Science 24 6 : 1275 (1989)) may also be prepared. 
Such recombinant antibodies may be further modified, 
such as by modification of complementarity determining 
regions to increase or alter affinity, or "humanizing" 

10 such antibodies. Such antibodies may be incorporated 
into a kit for diagnostic purposes, for example. A 
diagnostic kit may be employed to determine the location 
and/or amount or OB receptor of an individual. 
Diagnostic kits may also be used to determine if an 

15 individual has receptors which bind OB protein, or those 
which, to varying degrees, have reduced binding capacity 
or ability. As stated infra , such antibodies may be 
prepared using immunogenic portions of an OB receptor 
protein. Such selective binding molecules may 

20 themselves be alternatives to OB protein, and may be 
formulated for pharmaceutical composition. 

Such proteins and/or nucleic acids may be used 
for tissue distribution assays (for example, as provided 
in the working example below) or for other assays to 

25 determine the location of OB receptor. 

The present OB receptor protein family may be 
used in methods to obtain OB protein analogs, mimetics 
or small molecules. One would simply prepare a desired 
OB receptor protein, particularly one with capability of 

30 binding to native OB protein, and assay the test 

molecule, which may be labelled with a detectable label 
substance, for ability to bind to such receptor. Other 
parameters, such as affinity, and .location of binding, 
may also be ascertained by methods available to those 

35 skilled in the art. For example, one could use portions 
of the present OB receptors, particularly portions in 



PCTAJS97/00128 

WO 97/25424 

- 42 - 

the extracellular domain which are necessary for ligand 
binding, to determine the location of such binding. One 
could prepare OB receptors which have various 
truncations or del tions of regions of the extracellular 
5 domain which could be used to determine the location of 
test molecule binding. One could use an OB receptor 
known to be defective in native OB binding, such as 
potentially one from an individual having such defective 
receptors, and use this as the basis for ascertaining OB 
10 protein which would be effective to result in desired 
biological activity (i.e., weight loss, reduction in 
blood dyslipidemias or lowering of cholesterol levels, 
reduction in incidence or severity of diabetes) . Other 
uses include solely cosmetic uses for alteration of body 
15 appearance, particularly the removal of fat. 

The present OB receptor protein or nucleic 
acids may also be useful to identify substances which 
"up-regulate" OB protein or receptor. For instance, the 
temporal expression of OB receptor in Yi^a may be useful 
2 0 to determine if an administered substance causes an 

increase or decrease in OB receptor. One may conclude 
that an increase in OB receptor expression results in 
modultion of weight or lipid metabolism. 

The divergence in the C-terminus may represent 
25 OB receptors with different signal transduction 

abilities. Therefore the different receptor family 
members may be used for different assays, depending on 
the type of signal transduction observed. It is thought 
that at least a portion of the intracellular domain is 
30 necessary for signal transduction ftUprfl ) • 

The following examples are offered to more 
fully illustrate the invention, but are not to be 
construed as limiting the scope thereof. 
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EXAMPLE 1 : IDENTIFICATION OF HUMAN OB RECEPTOR PROTEIN 

Human OB receptor protein DNA was identified 
5 in a human liver cDNA library in two steps. The first 
step used two primers in polymerase chain reaction (PCR) 
to amplify a selected 300 base pair region from the 
human liver cDNA library. The second step used the PCR 
fragment as a probe to screen the human liver cDNA 

10 library. Thirteen clones were obtained, but these were 
incomplete at the 5* end. A procedure was performed to 
complete the 5' end to make complete clones. Twelve 
clones were sequenced. These twelve clones were 
identified as either "A", "B" or "C" as denoted by the 

15 C-terminus of the predicted amino acid sequence. 

Polymerase Chain React ion . 

The original PCR primer was based on the 5 f 
end and the 3* end of a 416 base pair sequence having 
20 GenBank Database Accession No. T7384 9. This sequence 
was selected on the basis of a known motif present in 
cytokine receptors, "WSXWS". 

The 5* primer had the sequence 73-96 of the 
416 bp sequence. The 3 f primer had the sequence 337-360 
25 of the 416 bp sequence. 

These primers were used to probe a human cDNA 
liver library (Stratagene) . Standard methods were used. 

This resulted in a PCR fragment having the 
sequence 73-360 of the 416 bp fragment. 

30 

Hybridization . 

The 300 bp PCR fragment was used to probe a 
human liver cDNA library (Stratagene) using standard 
methods. This second hybridization resulted in 13 
35 positive clones. These were partial clones, incomplete 
at the 5' end. 
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^nmplfition nf f.he 5' end. 

Rapid Amplification of cDNA End ( "RACE" , kit, 
GIBCO/BRL) was used to obtain the full length clones. 

Sequencing results. 

Sequencing revealed the three types of OB 
receptor DNAs . Of the thirteen clones, 4 clones were 
the "A" type (Seq. ID Nos. 1 and 2); 1 clone was the "B" 
10 type (Seq. ID Nos. 3 and 4) and 4 clones were of the "C 
type (Seq. ID Nos. 5 and 6). 

As can be seen from the Sequence 
Identifications (below), OB receptor A is 896 amino 
acids long, "B" is 904 amino acids long, and "C" is 958 
15 amino acids long. These different OB receptors are 
identical at amino acid positions 1-891, and diverge 
almost completely beginning at position 892. The leader 
sequence is postulated to be, by hydrophobicity 
analysis, amino acids 1-21 <M-A) , 1-22 (M-F) or 1-28 (M-I), 
20 with the mature protein beginning at positions 22(F), 
23 (N) or 29 (T) . Based on hydrophobicity analysis, the 
leader sequence is most likely to be at positions 1-21 (M 
through A) . Chinese Hamster Ovary Cell ("CHO") cell 
production of the secreted form of OB receptor protein 
25 also produced a protein having amino acid number 22 as 
the first amino acid of the mature protein. The 
transmembrane region is likely to begin at either 
position 840 (A) or 842 (L) through position 862(1) , 
863 (S) or 864 (H). For OB receptor type "A", the last 
30 amino acid is located at position 896 and is a lysine 
(L) . For OB receptor type "B", the last amino acid is 
located at position 904 and is a glutamine (Q) . For OB 
receptor type "C", the last amino acid is located at 
position 958 and is glutamic acid (E) . 
35 For OB receptor protein type "C", the C- 

terminal region possesses high homology to a known human 
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transposable element. From nucleotide 2737 through 2947 
of the present human OB receptor protein type "C", there 
is a 98.1% homology with a 211 base section of a human 
retrotransposable element described in Ono et al., Nucl. 
5 Acids Res. 1£: 8725-8737 (1987) (bases 520 through 731, 
SINE-R11, GENBANK accession no. x07417) . 

EXAMPLE 2 : TISSUE DISTRIBUTION 

10 Tissue distribution was ascertained using two 

methods. The first method involved using the entire 
type "A" OB receptor. The second method involved using 
probes which are specific to the C-terminal region of 
the protein. Since these C terminal regions are diver- 

15 gent, the second method detected the tissue distribution 
of the different members of the OB receptor family. 
The first method used a Northern Blot kit 
(Clontech) , using the entire type A OB receptor DNA as a 
probe. The second method used PCR with primers specific 

20 to the nucleic acids encoding the divergent C terminus 
of the three types. Standard methods were used. 

Table 2 shows the results for the Northern 
Blot and the PCR methods. The "+" indicates the 
investigator's subjective determination of the strength 

25 of signal. For the Northern Blot analysis, a triple 

indicates that a result (a dark "band" on the X- 
ray film) was seen upon overnight exposure of the film. 
A double "++" indicates that bands were seen at two 
weeks of exposure. A single " + " indicates that the 

30 bands were seen after three weeks of exposure. In 

addition, using this method, two molecular weights were 
observed, one at 4 Kb and one at 6.2 Kb. Although 
distribution was ubiquitous, the strongest signals were 
seen for ovary, heart and liver. For the PCR analysis, 

35 OB receptor "A" was seen in all tissue types tested 

(prostate, ovary, small intestine, heart, lung, liver 
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and skeletal muscle) , type "B" was seen only in lung and 
liver, and type "C" was seen in ovary, heart, lung and 
liver . 

5 Table 2 

Ti««me p i«fcirlfaii+4oti of *>*ft Novel OB Receptor 





Northern Blot 




PCR 




4 Kb 


6.2 Kb 




A 


B 


C 


Spleen 




+ 










Thymus 


- 


+ 










Prostate 


- 


+ 




+ 


— 


— 


Testis 




♦ 










Ovary 




+ + + 




+ 




+ 


Small 
Intestine 




+ + 




+ 






Colon 














Peripheral 

blood 

Leukocyte 














Heart 




+ + + 








+ 


Brain 














Placenta 




+ 










Lunq 


+ 


+ + 




+ 


+ 




Liver 


+ + + 


+ + + 






+ 


+ 


Skeletal 
Muscle 




+ + 










Kidney 




+ + 










Pancreas 




+ 











reyAMPT.R, 3 : IDENTIFICATION OF HUMAN OB RECEPTOR GENOMIC 
DNA AND CHROMOSOME LOCALIZATION; IDENTIFICATION OF HUMAN 
OB RECEPTOR W D" 

15 

The full length human OB receptor genomic DNA 
was also prepared. OB receptor "A" cDNA, in its 
entirety, was used as a probe against a human genomic 
DNA library, using materials and methods from a 
20 commercially available kit (Genome Systems, using a 
human genomic library in a PI vector) . A single 
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positive clone was detected. There are introns located 
at (with respect to OB receptor "A" DNA) base pair 
number: 559, 1059, 1350, 1667, 1817, 1937, 2060, 2277, 
2460, 2662, and 2738. 
5 The human OB receptor gene was localized to 

human chromosome 1P31 by FISH analysis (Genome Systems) ., 
Human chromosome 1 is thought to correspond to mouse 
chromosome 4C7, which is presumed to be the location of 
the dt> locus. 

10 A further chromosomal sequence was isolated. 

This chromosomal DNA sequence was isolated from a human 
genomic library as described above. This chromosomal 
sequence encodes what is. here denominated human OB 
receptor "D", and the encoded amino acid sequence is set 

15 forth in SEQ. ID No. 7. A cDNA encoding this amino acid 
sequence is set forth in SEQ. ID No. 8. The chromosomal 
DNA intron/exon junction map is set forth as SEQ. ID 
No. 9. 

As with forms "A", "B", and ,B C", for the 
20 present form "D" OB receptor protein, the first amino 
acid of the mature protein is likely (using 
hydrophobicity analysis) to begin at position 22 (F) , 23 
(N) or 29 (T) . The last amino acid of the protein is at 
position 1165 and is a valine residue. As with the 
25 other forms, the extracellular domain extends from 

" position 22 (F) , 23 (N) or 29 (T) to position 839 (D) or 
841 (G) . The transmembrane domain appears to begin at 
position 84 0 (A) or 842 (L) . The end of the 
transmembrane domain appears to be located at position 
30 862 (I), 863 (S) or 864 (H) . The Oterminal region, 
beyond the transmembrane region, is likely to be 
involved in signal transduction, and is located at 
position 863 (S) , 864 (H) or 865 (Q) through position 
1165 (V) . 

35 The present OB receptor form "D" is identical 

to that published by Tartaglia et al, Cell £2.: 1263-1271 
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(December 29, 1995) with the exception of a single amino 
acid change at amino acid position 976 (nucleotide codon 
begining at position 3022) . The present type "D" amino 
acid at position 97 6 is aspartic acid, and the published 
5 amino acid corresponding to the same position is 

alanine. This is a non-conservative substitution, £££ 
infra , and since the location of the substitution is 
within a region thought important for signal trans- 
duction, this change could affect the function of the 
10 molecule. 

RXAMPLE 4 : PREPARATION OF SOLUBLE OB RECEPTOR 

Three forms of soluble human OB receptor have 
15 been prepared: 

1. Trader + Extrace llular Domain (Seq. 
ID Nos. 10 and 11): A recombinant form of the soluble 
human OB receptor was prepared. This form encompasses, 
in the immature protein, the leader sequence and the 
20 extracellular domain (amino acids 1-839) . The mature 

protein would have the leader sequence deleted, and the 
first amino acid of the mature recombinant soluble human 
OB receptor would be 22 (F) , 23 (N) or 29 (T) . This 
protein was expressed as described below. 
25 2 . T.^aripr + Extracellular Domain + C- 

tPrmin^l FLAG (Seq. ID No. 12) : A second form of the 
recombinant soluble human OB receptor was also prepared. 
This form had a "FLAG" tag located at the "C" terminus 
of the protein. The "FLAG" peptide is a useful research 
30 tool as it allows one to follow the protein using an 
antibody which recognizes the "FLAG" peptide. Such 
reagents are commercially available (IBI, New Haven, 
CT) . This protein was expressed as described below. 

3. Native Sp lirv* Variant (Seq. ID Nos. 
35 13 and 14) : This form is believed to the the 

recombinant form of a naturally occurring secreted, 
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soluble human OB receptor. This form has most of the 
amino acids found in the extracellular domain (amino 
acids 22-798) , and a unique 6 amino acid sequence at the 
carboxyl terminus. Beginning at amino acid position 799 
5 of Seq. ID No. 13, the amino acid sequence of this 

native splice variant human OB receptor protein is "G K 
F T I L." 

EXAMPLE 5 : PREPARATION OF EXPRESSION VECTORS 

10 

Recombinant human OB receptor expression 
vectors have been prepared for expression in mammalian 
cells. As indicated above, expression may also be in 
non-mammalian cells, such as bacterial cells. The type 

15 "A" cDNA (Seq. ID No. 2) was placed into a commercially 
available mammalian vector (pCEP4, Invitrogen) for 
expression in mammalian cells, including the commercial- 
ly available human embryonic kidney cell line, "293". 

Recombinant human OB receptor expression 

20 vectors have been prepared for expression of recombinant 
soluble OB receptor, consisting of the leader sequence 
and the extracellular domain (Seq. ID Nos . 10 and 11), 
using the same system as above (the commercialy 
available mammalian vector pCEP4, and ,, 293" cells) . 

25 This recombinant soluble human OB receptor was also 
expressed in CHO cells in a similar way. 

The "FLAG-tagged" form (Seq. ID No. 12) of the 
recombinant soluble human OB receptor, and the "D" form 
(Seq. ID No. 7) were also expressed in "293 M cells in a 

30 similar fashion as above. 

Detection of desired protein was accomplished 
using BIACORE (Pharmacia) analysis. This analysis is 
analogous to that described in Bartley et al., Nature 
36fi : 558-560 (1994) . 

35 Essentially, the BIACORE machine measures 

affinity interactions between two proteins. In this 
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case, the OB protein was immobilized on the machine, and 
conditioned media from cell lines expressing the OB 
receptor was added to the machine. Any receptor protein 
present in the conditioned media bound to the OB protein 

'5 surface. The BIACORE machine gave a read-out indicating 
that receptor protein, was being expressed. For 
recombinant soluble receptor (Seq. ID No. 10) expression 
in "293" cells, the read-out was 191.0 relative to a 
baseline readout of 0. For recombinant soluble receptor 

10 (SEq. ID No. 10) expression in CHO cells, the read-out 
was 150.9 relative to a baseline readout of 0. For 
recombinant soluble receptor with a C-terminal FLAG-tag 
(Seq. ID. No. 12), the read-out was 172.0 relative to a 
baseline of 0. 

15 For expression in bacterial cells, one would 

typically eliminate that portion encoding the leader 
sequence (e.g., potentially amino acids 1-21, 1-22 or 1- 
28) . One may add an additional methionyl at the 
N-terminus for bacterial expression. Additionally, one 

2 0 may substitute the native leader sequence with a 
different leader sequence, or other sequence for 
cleavage for ease of expression. 

F.XAMPL5 6 : DEMONSTRATION OF SIGNAL TRANSDUCTION 

25 

This example demonstrates that the "D" form is 
active to produce a signal within a cell, whereas in the 
same cell type, the "A" form does not. The signal 
transduction assay was performed by the use of "293" 

30 cells transiently expressing either the "A" or the "D" 
form (see above for preparation of the "293" expression 
clones) . Phosphorylation of molecules predicted to be 
involved in signal transduction within the cell was 
examined upon OB protein binding to the OB receptor 

35 protein tested. The results demonstrate that upon 

binding of OB protein to the extracellular domain, the 
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"D" form of the present OB protein rec ptor transduces a 
signal sufficient to initiate phosphorylation of 
signalling molecules. 

5 Methods 

1 . QB receptor molfimlp^ As indicated ' 
above, the f, A M form (Seq. ID No. 1) and the "D" form 
(Seq. ID. No. 7) were studied. 

2. Expression svstPm. The pCEP 4 system (as 
10 described above) having inserted DNA encoding the W A M 

form (Seq. ID No, 2) or the "D" form (Seq. ID No. 8) was 
used to transfect "293" cells. These cells did not 
allow for the pCEP4 vector to integrate into the genome, 
so such expression was transient. Non-recombinant 
15 (mock-transfected) cells were also prepared as controls. 

3. Detection of Phosphorylation. Mock 

transfected cells and cells expressing the "A" form or 
the "D" form were analyzed. Prior to treatment the 
cells were serum-starved by incubation in media with 

20 0.5% serum for 16 hours prior to the treatments. The 

cells were treated with the OB protein (10 mg/ml) for 15 
minutes at 37°C, after which the cells were lysed in 
modified NP40 buffer (50 mM Tris, pH 8.0, 150 mM sodium 
chloride, 1% NP40, 10 mg/ml aprotinin, 5mM EDTA, 200 mM 

25 sodium orthovanadate) . Phosphotyrosine containing 

proteins were immunoprecipitated (Anti-phosphotyrosine 
antibody 4G10, UBI, Lake Placid, NY) , and separated by 
SDS polyacrylamide gel electrophoresis. After 
electrophoresis and electroblotting to membranes the 

30 immunoprecipitates were probed with antibodies to 

various signal transduction molecules. Antibodies to 
STATs, JAKs and ERKs were purchased from Santa Cruz 
Biotechnology Inc. Immune complexes were detected by 
horseradish peroxidase conjugated secondary reagents 

35 using chemi luminescence as described by the manufacturer 
(ECL, Amersham) . As a positive control, 32D cells were 
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treated with IL-3, which is known to activate by 
tyrosine phosphorylation most of the molecules being 
analyzed. 

4. poults . Results are presented in Table 
5 3, below. As can be seen, only the "D" form was able to 
respond to either mouse or human OB protein as detected 
by phosphorylation of JAK and STAT molecules* A " + 
designation indicates signal was detected, a 
designation means that no signal was observed. 

10 

TABLE 3 



Signal 
/AB* 


293 
Alone 


293/D 
hrOB* 


293/D 
mrOB** 


293/A 
hrOB# 


293/A 
mrOB## 


32D 
IL-3 


STAT1 




+ 










STAT3 




+ 


+ 






+ 


STATS 




+ 


+ 






+ 


JAK1 




+ 


+ 






+ 


JAK 2 




+ 


+ 






+ 


JAK 3 














TYK2 




+ 


+ 








ERKS 
1,2 















* Antibody detection target 

* 2 93 cells expressing receptor form "D", treated with 
15 recombinant human OB 

** 293 cells expressing receptor form "D" treated with 
recombinant murine OB 

* 293 cells expressing receptor form "A" treated with 
recombinant human OB 

20 ## 293 cells expressing receptor form "A" treated with 
recombinant murine OB 

The "D" form is capable" of initiating 
signalling through the JAK/STAT pathways in 293 cells, 
whereas the "A" form cannot. 
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EXAMPLE 7 : USE OF SOLUBLE OB RECEPTOR AS A THERAPEUTIC 

This example demonstrates that soluble OB 
5 receptor protein acts to protect the activity of OB 
protein. Below, soluble OB receptor and/ or OB protein 
was delivered to a mammal via "gene transplant" — that 
is, via bone marrow cells engineered to express the 
desired DNAs . When soluble OB receptor combined with OB 

10 protein was delivered, the animals lost more weight than 
delivery of OB protein alone. This demonstrates the 
protective activity of OB receptor protein. 

While not wishing to be bound by theory, one 
explanation of the mode of action is that soluble OB 

15 receptor protein acts to protect the OB protein in serum 
from agents or conditions which could diminish its 
activity. The protective action appears to increase 
circulating half-life of the protein. As such, the 
present example demonstrates that OB receptor either 

20 alone, or administered as a complex with OB protein (or 
analog or derivative thereof) could act as a therapeutic 
agent . 

Mate-rials and methods: 

25 1. -Preparation of recombinant, nh retroviral 

vector Packaging Cells. 

Use of murine ob cDNA . Full length wild-type 

murine ob cDNA was amplified by the PCR using synthetic 

oligonucleotides designed from the published sequence 
30 Zhang et al., Nature 372: 425-432 (1994) .Linkers (An JEcq. 

RI linker and a Bol II linker) were used to facilitate 

subcloning. 

Use Of soluble recombinant human OB receptor 
cDNA . Methods similar to those above were used. A 
35 construct containing the recombinant human soluble 

receptor of Seq. ID No. 10 was used, and modified with 
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linkers to facilitate cloning (i.e., the addition of a 
bq! ii restriction endonuclease recognition site) . 

P^rfttnpnr. nf desired rDNA into vector. PCR 
products were digested with EcoRI and Bgill and cloned 
5 into similarly-digested parental vector (pMSCV2.1) under 
the transcriptional control of the viral LTR promoter. 
The parental MSCV vector (supplied by R. Hawley, 
University of Toronto, Canada) was derived from MESV 
(murine embryonic stem cell virus) and contains a 
10 neomycin phosphotransferase resistance (neo r ) gene 
driven by an internal mouse phosphoglycerate kinase 
(PGK) promoter, as described- Hawley, et al, J. Exp. 
Med. I2£: 1149 -1163 (1992) . The parental plasmid 
pMSCV2.1 arid pMSCV-OB were independently electroporated 
15 into the GP+E-86 packaging cell line (supplied by Dr. A. 
Bank, Columbia University, NY) Markowitz et al.,. J. 
Virol. £2:1120-1124 (1988). Transient supernatants were 
harvested from electroporated populations and used to 
infect tunicamycin treated parental GP+E-86 cells. 
2 0 Tunicamycin treatment relieves the block to 

superinfection of the parental packaging cells. G418 
(0.78 mg/mL, 67% active, GIBCO Laboratories, Life 
Technologies, Inc., Grand Island, NY) resistant clones 
were selected from each infected population and titered 
25 by infection of NIH3T3 cells. Clones with the highest 
G418 resistant titer were expanded and frozen as 
aliquots. Each bone marrow infection and 
transplantation experiment used aliquots from the same 
passage of frozen viral packaging cells. Both the 
30 parental and ob packaging cell lines were tested for th< 
presence of, and found to be free from, replication 
competent virus using a sensitive marker rescue assay. 
Moore, et al., (1993) in: Gene Targeting: A Practical 
Approach, Joyner, Ed. (Oxford University Press, New 
35 York, NY) . 
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2 . Production of Retroviral Supernatants. 
Recombinant virus-producing packaging cell lines were 
grown in 175cm 2 tissue culture flasks in Iscove's 
Modified Dulbecco's Medium (IMDM) (GIBCO) , 10% (v/v) 

5 FBS, at 37°C. Sub-confluent (approximately 60%) 

monolayers of cells were fed with fresh medium 24h prior 
to harvest of virus-containing supernatants . Viral 
supernatants were removed from packaging cell lines by 
aspiration, sterile filtered (0.45mM) and added directly 
10 to bone marrow cultures. Fresh aliquots of frozen 
packaging cell lines were thawed for use in each 
experiment . 

3. Bone Marrow Infection and Transplant.at i nn . 
Eight to 12-week old female C57BL/6J ( + / + ) or (ob/ob) 

15 mice were used as bone marrow donors and recipients. 

All mice were purchased from The Jackson Laboratory (Bar 
Harbor, ME) and housed under specific pathogen-free 
conditions in a vivarium in accordance with governmental 
regulations and institutional guidelines. 
20 Bone marrow cells were harvested from femurs 

and tibias of donor mice 4 days post 5-f luorouracil (5- 
FU, Sigma Chemical Co., St. Louis, MO) treatment (150 
mg/kg i.v.). Bone marrow cells (6 X 10 5 /mL) were 
incubated in 150mm tissue culture dishes (30mL/dish) 
25 containing fresh viral supernatant (as described above) , 
15% FBS, 6 mg/mL polybrene (Sigma), 0.1% bovine serum 
albumin (BSA, Fraction V, Sigma), 2.5 ng/mL recombinant 
mouse IL-3 (rmIL-3) , 100 ng/mL each of recombinant human 
IL-6 (rhIL-6) , recombinant human IL-11 (rhIL-11) , and 
30 recombinant rat SCF (rrSCF) . . All growth factors were 
produced by Amgen, Inc. (Thousand Oaks, CA) . Culture 
media were replaced daily for 3 days with fresh virus- 
containing supernatant and growth . factors . 

At the end of the infection period, total non- 
35 adherent and adherent cells were washed and resuspended 
in 1% BSA-saline and transplanted into g-irradiated (12 
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Gy, Cs 137 ) mice. Each animal was transplanted with 2.5 

X 10 6 syngeneic cells. There were approximately 10 

animals per cohort. 

4 . Analysis nf OR prot.pin expression in 

5 tran^fpcf pri rPllg *nH transnlant.ed animals. For 

transfected bone marrow cells. Western analysis was 
. performed. Vector packaging cell supernatant was 
resolved by SDS-PAGE (16% acrylamide) , then transferred 
to Hybond-ECL (Amersham, Arlington- Heights, ID . The 
10 filter was incubated with affinity-purified rabbit a- 
mouse OB protein polyclonal antibody (lmg/mL) in T-TBS 
buffer (20mM Tris-chloride, pH7.6, 137mM NaCI, 0.1% 
Tween20) at room temperature for 45 min. Horseradish 
peroxidase (HRP) -con jugated donkey a-rabbit IgG 
15 (Amersham) was diluted in T-TBS (1:2500) and incubated 
with the filter at room temperature for 45 min. 
Enhanced chemiluminescence (ECL, Amersham) detection was 
performed as recommended by the manufacturer. 

For transplanted animals, serum was analyzed. 
20 Animals were bled retroorbitally, under isofluorane 

anesthesia. Serum from transplanted ob/ob animals was 
resolved by SDS-PAGE (4-20% acrylamide) under non- 
reducing and reducing conditions, then transferred to 
Trans-Blot (Bio-Rad Laboratories, Hercules, CA) 
25 membranes. The membranes were incubated for 2 hours at 
room temperature with HRP-con jugated rabbit a-mouse OB 
protein antibody (0.125mg/mL) in T-TBS buffer containing 
5% fetal bovine serum and 1% bovine serum albumin. 
Bound OB protein was detected by ECL (Amersham) , 
30 performed as recommended by the manufacturer. 

For quantitation of soluble OB protein levels, 
serum from transplanted animals was subjected to ELISA 
analysis. Briefly, affinity-purified rabbit a-OB 
protein polyclonal antibody was coated onto 96-weIl 
35 plates. Standards (purified recombinant OB protein 
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monomer, Pelleymounter et al., Science 269 : 540-543 
(1995) and experimental samples were added, and the 
plates were incubated at room temperature. The plates 
were washed twice and affinity-purified rabbit a-OB 
5 protein antibody conjugated to horseradish peroxidase 
was added. Following incubation at room temperature, 
the plates were washed four times with TNE-Tween20 . 
TMB/peroxide substrate was added and the color reaction 
was read at 450nm in a Molecular Devices plate reader. 

10 OB protein concentrations in sera were estimated by 

comparison to a standard curve prepared from internal 
standards. OB protein levels were reliably measured in 
samples containing >160 pg/mL. 

5. Body Weight and Food Intake. Mice were 

15 offered pelletized rodent chow (PMI Feeds, Inc., St, 
Louis, MO) ad libitum . The body weight of individual 
animals was measured daily for the first two months of 
analysis, and weekly thereafter. Food consumption was 
measured daily on selected groups of individually-housed 

20 animals. 

Results 

Results are presented in Tables 4 and 5 below. 
Administration of OB protein receptor increased the 

25 effectiveness of OB protein. This may have been 

accomplished via an increased circulation time of OB 
protein in the presence of OB protein receptor. 

As can be seen in the Table, animals 
administered a combination of OB protein and OB protein 

30 receptor (via genetic therapy) had a greater weight loss 
after 28 days than either composition alone. The Table 

presents the results of two experiments (" / ") . As 

can be seen, use of the OB protein alone at day 40 
resulted in animals with 87.5% and 72.2% of the starting 

35 weight. Using OB receptor in combination with OB 
protein, however, resulted in animals with 68% and 
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53.6% of the starting weight. Use of the receptor alone 
appeared to have little effect, if any. 

TABLE 4 



Treatment 


Weight (g) 
decrease at 
day 28 (ave) 


% starting 
weight 
(ave) 
dav 28 


% starting 
weight 
(ave) 
dav 40 


OB alone* 


6.3/12.7 


87.9/75.3 


87.5/72.2 


Receptor** 
alone 


[1.4]/[0.3] 


103/100.6 


104.2/101.7 


OB + 
Receptor*** 


12.6/16.8 


76.3/67.5 


68/53.6 



* 50% bone marrow cells trans fected with OB protein 
cDNA as described above, and 50% bone marrow cells 
without genetic alteration 

** 5 0% bone marrow cells transfected with OB receptor 
10 protein cDNA as described above, and 50% bone marrow 
cells without genetic alteration 

*** 50% bone marrow cells transfected with OB protein 
cDNA as described above, and 50% bone marrow cells 
transfected with OB receptor protein cDNA as described 
15 above. 

Table 5, below, contains results of the OB 
levels found in the serum from animals administered OB 
protein alone, or administered OB protein in combination 
with OB protein receptor (via the "gene therapy" method 
20 of this example) . The data reflect nanograms of OB 
protein per milliliter of serum, plus or minus the 
standard error of the mean. 
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TABLE 5 



Treatment 


Experiment #1* 


Experiment #2** 


OB alone* 


2.93 +/- 0.77 


9.74 +/- 1.02 


Receptor** 
alone 


0.08 +/- 0.05 


0.12 +/- 0.07 


OB + 
Receptor*** 


12.11 +/- 1.90 


15.18 +/- 2.52 



50% bone marrow cells transfected with OB protein 



5 cDNA as described above, and 50% bone marrow cells 
without genetic alteration 

** 50% bone marrow cells transfected with OB receptor 
protein cDNA as described above, and 50% bone marrow 
cells without genetic alteration 
10 *** 50% bone marrow cells transfected with OB protein 
cDNA as described above, and 50% bone marrow cells 
transfected with OB receptor protein cDNA as described 
above . 

* Experiment #1 was conducted as described above, 
15 with OB protein serum levels measured after 38 days. 
tt Experiment #2 was also conducted as described 
above, with OB protein serum levels measured after 24 
days . 

The "data demonstrate the protective effects of 
20 OB receptor. As can be seen, in the presence of OB 

receptor, OB protein has a higher accumulation in the 
serum. The degree of accumulation is observed to 
increase inversely with the levels of OB protein in the 
serum. In Experiment #1 (with a base OB protein level 
25 of about 2.93 ng/ml) , the OB protein serum level 

increased about 400% with the addition of receptor, 
where in Experiment #2 (with a base of about 9.74), the 
OB protein serum level increased by about 25%. 

OB receptor administered either alone or in 
30 association with OB protein (or analogs or derivatives 
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thereof) may serve to increase the circulation time of 
OB protein, and therefore enhance the therapeutic 
efficacy of either exogenous or endogenous OB protein. 

5 KX AMPLE 8 : PREPARATION OF SELECTIVE BINDING MOLECULES 
Animals were immunized for the preparation of 
polyclonal antibodies using the following peptides (with 
respect to the numbering of the amino acids for OB 
receptor A, Seq. ID No, 1) : 54-64; 91-100; 310-325; 
10 397-406; 482-496; 874-885; and, with respect to amino 

acids of OB receptor "C" (Seq. ID No. 5), 910-929. Some 
of the polyclonal antibodies prepared (in rabbits) were 
tested for ability to bind to recombinant human OB 
receptor protein. The polyclonal antibody prepared 
15 against amino acids 54-64 was found to have the highest 
affinity for recombinant human OB receptor protein. The 
polyclonal antibody prepared against amino acids 397-406 
was also found to bind to recombinant human OB receptor 
protein. The polyclonal antibody prepared against amino 
20 acids 91-10 0 was found to slightly bind to recombinant 
human OB receptor protein. The polyclonal antibody 
prepared against amino acids 874-885 was found not to 
bind to recombinant human OB receptor protein. 

An additional study was performed which 
25 demonstrates the expression and purification of the 

extracellular domain of the OB receptor protein in CHO 
cells, and antibodies which recognize this OB protein 
receptor extracellular domain. 

The extracellular domain of the human OB 
30 receptor protein was expressed as a secreted, soluble 
protein in CHO cells as previously described supra . 
Individual cell lines were isolated and grown in 
increasing amounts of methotrexate to increase 
selection/expression of the recombinant receptor protein 
35 (100, 200 or 500 micrograms methotrexate per ml of 

media) . Conditioned media from the CHO cell lines was 
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collected, and the proteins in the conditioned media 
were fractionated by SDS-PAGE. The OB receptor 
extracellular domain migrated as a broad band with an 
apparent size range of about 140 kDa to about 200 kDa. 
5 The OB receptor protein extracellular domain was 
detected by Western Blot analysis using polyclonal 
antibodies prepared against a portion of the extracel- 
lular domain of the OB receptor protein. The unfolded, 
bacterially expressed protein was used as an antigen to 

10 generate antisera in rabbits. The identified OB receptor 
extracellular domain was purified by affinity 
chromatography. The purified protein was sequenced at 
the amino terminus to confirm that it was the OB 
receptor and also to determine the start of the mature 

15 protein (after signal peptide cleavage) as expresed in 
CHO cells. It was found that amino acid no. 22 
(according to the amino acid sequence numbering of Seq. 
ID No. 1, infra ) , was the first amino acid of the mature 
protein as expressed in CHO cells. 

20 Other immunogenic peptides may be used. 

Polyclonal, monospecific polyclonal, monoclonal, 
antibody fragments, and recombinant antibodies may be 
prepared using methods available to those skilled in the 
art . 

25 One may further use recombinant techniques or 

peptide synthesis methods to alter the character of such 
selective binding molecules. This may be accomplished 
by preparing recombinant antibodies having altered 
complementarity determining regions (sometimes referred 

30 to in the art as "CDR's") to, for example "humanize" the 
antibodies by using human F c (constant) regions. Other 

types of recombinant antibodies, for example, those 
having CDR's altered to enhance affinity or selectivity 
to one or more members of the OB receptor family, may be 
35 prepared and used using methods available to those 
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skilled in the art. Sfifi Winter et al . , Nature 293- 
299 (1991) . 

The present OB receptor protein may be used as 
an assay to screen for desired selective binding 
5 molecules. Such assay may be based on binding 

capability, or biological activity, or, other means of 
detecting signal transduction. For example, if one were 
to prepare a series of modified antibodies, one could 
test them for affinity (i.e, binding strength) against 
10 the target OB receptor. 

The selective binding molecules may be useful 
for diagnostic purposes, such as tissue distribution 
analysis, or to diagnose the relative affinity of an 
individual's OB receptors for such selective binding 
15 molecule to determine the functionality of an 

individual's OB receptor during a course of therapy. 
Selective binding molecules may be alternative 
therapeutic or cosmetic products to OB protein. 

20 KYAMPT.R 9 ; ^NF. THERAPY 

One may deliver the present OB receptor 
protein via gene therapy, as described i nfra . 

One may envision, using materials and methods 
available to those skilled in the art and provided 
25 herein, using T-cells as an agent carrying DNA 

expressing OB receptor for gene therapy. An individual 
would have T-cells selected using CD34 + selection and a 
magnetic microparticles selection device. Such cells 
would be transfected with the desired DNA, or the 
30 regulation of the desired coding region may be altered 
using homologous recombination or other in s i tu 
techniques. The transduced cells could be selected 
empirically, using means to detect the desired protein, 
or a marker may be included which permits indirect 
35 detection (i.e., a selectable marker as is known in the 
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art) . Optionally, such cells could be expanded, for 
example, using one or more growth factors such as SCF or 
an interleukin, and such cells could be stored for 
future use. In such a way, the procedure would only 
5 have to be accomplished once or infrequently in an 
individual's lifetime, for later transfer into the 
individual. The cells would be re-planted into the 
individual, and the individual would be monitored for 
desired therapeutic effect, such as weight 
10 loss/maintenance of weight, diabetes recurrence, blood 
lipid levels, or other conditions* 

Illustrative Nucleic Acid and Amino Acid Sequences 

The below amino acid and DNA sequences are 
15 those to which reference has been made. An asterick ("*" ) 
indicates the position of a stop codon. 
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H upan OB pprpprnr "A" Amino Acid Sequence (Seq. ID NO . 1 (Amino 
A^-iH. siP fTio ipt.tpr abbreviation); 

1 MI CQKFCWL LHWEFIYVIT AFNLSYPITP WRFKLSCMPP NSTYDYFLLP 
51 AGLSKNTSNS NGHYETAVEP KFNSSGTHFS NLSKTTFHCC FRSEQDRNCS 
101 LCADNIEGKT FVSTVNSLVF QQIDANWNIQ CWLKGDLKLF ICYVESLFKN 
151 LFRNYNYKVH LLYVLPEVLE DSPLVPQKGS FQMVHCNCSV HECCECLVPV 
201 PTAKLNDTLL MCLKITSGGV IFQSPLMSVQ PINMVKPDPP LGLHMEITDD 
251 GNLKISWSSP PLVPFPLQYQ VKYSENSTTV IREADKIVSA TSLLVDSILP 
301 GSSYEVQVRG KRLDGPGIWS DWSTPRVFTT QDVIYFPPKI LTSVGSNVSF 
351 HCIYKKENKI VPSKEIVWWM NLAEKIPQSQ YDWSDHVSK VTFFNLNETK 
401 PRGKFTYDAV YCCNEHECHH RYAELYVIDV NINISCETDG YLTKMTCRWS 
451 TSTIQSLAES TLQLRYHRSS LYCSDIPSIH PISEPKDCYL QSDGFYECIF 
501 QPIFLLSGYT MWIRINHSLG SLDSPPTCVL PDSWKPLPP SSVKAEITIN 
551 IGLLKISWEK PVFPENNLQF QIRYGLSGKE VQWKMYEVYD AKSKSVSLPV 
601 PDLCAVYAVQ VRCKRLDGLG YWSNWSNPAY TWMDIKVPM RGPEFWRIIN 
651 GDTMKKEKNV TLLWKP LMKN DSLCSVQRYV INHHTSCNGT WSEDVGNHTK 
701 FTFLWTEQAH TVTVLAINSI GASVANFNLT FSWPMSKVNI VQSLSAYPLN 
751 SSCVIVSWIL SPSDYKLMYF IIEWKNLNED GEIKWLRISS SVKKYYIHDH 
801 FIPIEKYQFS LYPIFMEGVG KPKIINSFTQ DDIEKHQSDA GLYVIVPVII 
851 SSSILLLGTL LISHQRMKKL FWEDVPNPKN CSWAQGLNFQ KRTDIL*SLI 
901 MITTDEPNVP TSQQSIEY*K IFTF*RRGAN LKKIQLNF*E LTYGGLC*FR 
951 T*NRCVNLGS KCRFESSLDV *L 
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Human 


OB Receptoi 


• "A" DNA Seouence fSea. ID No. 2 


(DNA)) : 


l 


CCGCCGCCAT 


CTCTGCCTTC 


GGTCGAGTTG 


GACCCCCGGA 


TCAAGGTGTA 


51 


CTTCTCTGAA 


GTAAGATGAT 


TTGTCAAAAA 


TTCTGTGTGG 


TTTTGTTACA 


101 


TTGGGAATTT 


ATTTATGTGA 


TAACTGCGTT 


TAACTTGTCA 


TATCCAATTA 


151 


CTCCTTGGAG 


ATTTAAGTTG 


TCTTGCATGC 


CACCAAATTC 


AACCTATGAC 


201 


TACTTCCTTT 


TGCCTGCTGG 


ACTCTCAAAG 


AATACTTCAA 


ATTCGAATGG 


251 


ACATTATGAG 


ACAGCTGTTG 


AACCTAAGTT 


TAATTCAAGT 


GGTACTCACT 


301 


TTTCTAACTT 


ATCCAAAACA 


ACTTTCCACT 


GTTGCTTTCG 


GAGTGAGCAA 


351 


GATAGAAACT 


GCTCCTTATG 


TGCAGACAAC 


ATTGAAiGGAA 


AGACATTTGT 


401 


TTCAACAGTA 


AATTCTTTAG 


TTTTTCAACA 


AATAGATGCA 


AACTGGAACA 


451 


TACAGTGCTG 


GCTAAAAGGA 


GACTTAAAAT 


TATTCATCTG 


TTATGTGGAG 


501 


TCAT TATTTA 


AGAATCTATT 


CAGGAATTAT 


AACTATAAGG 


TCCATCTTTT 


551 


ATATGTTCTG 


CCTGAAGTGT 


TAGAAGATTC 


ACCTCTGGTT 


CCCCAAAAAG 


601 


GCAGTTTTCA 


GATGGTTCAC 


TGCAATTGCA 


GTGTTCATGA 


ATGTTGTGAA 


651 


TGTCTTGTGC 


CTGTGCCAAC 


AGCCAAACTC 


AACGACACTC 


TCCTTATGTG 


701 


TTTGAAAATC 


ACATCTGGTG 


GAG T AAT T T T 


CCAGTCACCT 


CTAATGTCAG 


751 


TTCAGCCCAT 


AAATATGGTG 


AAGCCTGATC 


C AC CAT TAG u 


TTTGCATATG 


801 


GAAATCACAG 


ATGATGGTAA 


TTTAAAGATT 


TCTTGGTCCA 


GCCCACCATT 


851 


GGTACCATTT 


CCACTTCAAT 


ATCAAGTGAA 


ATATTCAGAG 


AATTCTACAA 


901 


C AGT TATCAG 


AGAAGCTGAC 


AAGATTGTCT 


CAGCTACATC 


CCTGCTAGTA 


951 


GACAGTATAC 


TTCCTGGGf C 


TTCGTATGAG 


GTTCAGGTGA 


GGGGCAAGAG 


1001 


ACTGGATGGC 


CCAGGAATCT 


GGAGTGACTG 


GAGTACTCCT 


CGTGTCTTTA 


1051 


CCACACAAGA 


TGTCATATAC 


TTTCCACCTA 


AAATTCTGAC 


AAGTGTTGGG 


1101 


TCTAATGTTT 


CTTTTCACTG 


CATCTATAAG 


AAGGAAAACA 


AGATTGTTCC 


1151 


CTCAAAAGAG 


ATTGTTTGGT 


GGATGAATTT 


AGCTGAGAAA 


ATTCCTCAAA 


1201 


GCCAGTATGA 


TGTTGTGAGT 


GATCATGTTA 


GCAAAGTTAC 


TTTTTTCAAT 


1251 


CTGAATGAAA 


CCAAACCTCG 


AGGAAAGTTT 


ACCTATGATG 


CAGTGTACTG 
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1301 CTGGAATGAA CATGAATGCC ATCATCGCTA TGC TGAATT A TATGTGATTG 
1351 ATGT CAATAT CAATATCTCA TGTGAAACTG ATGGGTACTT AACTAAAATG 
1401 ACTTGCAGAT GGTCAACCAG TACAATCCAG TCACTTGCGG AAAGCACTTT 
1451 GCAATTGAGG TATCATAGGA GCAGCCTTTA CTGTTCTGAT ATTCCATCTA 
1501 TTCATCCCAT ATCTGAGCCC AAAGATTGCT ATT TGCAGAG TGATGGTTTT 
1551 TATGAATGCA TTTTCCAGCC AATCTTCCTA TTATCTGGCT ACACAATGTG 
1601 GATTAGGATC AATCACTCTC TAGGTTCACT TGACTCTCCA CCAACATGTG 
1651 TCCTTCCTGA TTCTGTGGTG AAGCCACTGC CTCCATCCAG TGTGAAAGCA 
1701 GAAATTACTA TAAACATTGG ATTATTGAAA ATATCTTGGG AAAAGCC AG T 
1751 CTTTCCAGAG AATAACCTTC AATTCCAGAT TCGCTATGGT TTAAGTGGAA 
1801 AAGAAGTACA ATGGAAGATG TATGAGGTTT ATGATGCAAA ATCAAAATCT 
1851 GTCAGTCTCC CAGTTCCAGA CTTGTGTGCA GTCTATGCTG TTCAGGTGCG 
1901 CTGTAAGAGG CTAGATGGAC TGGGATATTG GAGTAATTGG AGCAATCCAG 
1951 CCTACACAGT TGTCATGGAT ATAAAAGTTC CTATGAGAGG ACCTGAATTT 
2001 TGGAGAATAA TTAATGGAGA TACTATGAAA AAGGAGAAAA ATGTCACTTT 
2051 ACTTTGGAAG CCCCTGATGA AAAATGACTC ATTGTGCAGT GTTCAGAGAT 
2101 ATGTGATAAA CCATCATACT TCCTGCAATG GAACATGGTC AGAAGATGTG 
2151 GGAAATCACA CGAAAT TCAC TTTCCTGTGG ACAGAGCAAG CACATACTGT 
2201 TACGGTTCTG GCCATCAATT CAATTGGTGC TTCTGTTGCA AATTTTAATT 
2251 TAACCTTTTC ATGGCCTATG AGCAAAGTAA ATATCGTGCA GTCACTCAGT 
2301 GCTTATCCTT TAAACAGCAG TTGTGTGATT GTTTCCTGGA TACTATCACC 
2351 CAGTGATTAC AAGCTAATGT ATTTTATTAT TGAGTGGAAA AATCTTAATG 
2401 AAGATGGTGA AATAAAATGG CTTAGAATCT CTTCATCTGT TAAGAAGTAT 
2451 TATATCCATG ATCATTTTAT CCCCATTGAG AAGTACCAGT TCAGTCTTTA 
2501 CCCAATATTT ATGGAAGGAG TGGGAAAACC AAAGATAATT AATAGTTTCA 
2551 CTCAAGATGA TATTGAAAAA CACCAGAGTG ATGCAGGTTT ATATGTAATT 
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2601 


GTGCCAGTAA 


2651 


ATCACACCAA 


2701 


AGAATTGTTC 


2751 


CTTTGAAGTC 


2801 


CCAACAGTCT 


2851 


CAAATCTAAA 


2901 


TTATGTTGAT 


2951 


TAGATTTGAG 


3001 


TTTAAAAGTA 


3051 


TCATAAAACA 


3101 


AAATGTCATC 


3151 


AACTGCAACA 
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TTATTTCCTC TTCCATCTTA 
AGAATGAAAA AGCTATTTTG 
CTGGGCACAA GGACTTAATT 
TAATCATGAT CACTACAGAT 
ATAGAGTATT AGAAGATTTT 
AAAAATTCAG TTGAACTTCT 
TTAGAACTTA AAATAGATGT 
TCCAGTTTGG ATGTGTGATT 
GTATTCATGA TTTCTGGCTT 
TTAAGAAAAT TATGGCTGTT 
AAATATGTAG TAGACAATTT 
TCTGACAAAT TGCTTTAAAA 



TTGCTTGGAA CATTATTAAT 
GGAAGATGTT CCGAACCCCA 
TTCAGAAGAG AACGGACATT 
GAACCCAATG TGCCAACTTC 
TACATTTTGA AGAAGGGGAG 
GAGAGTTAAC ATATGGTGGA 
GTAAATTTGG GTTCAAAATG 
AATTTTCAAA TCATCTAAAG 
TTGATTTGCC ATATTCCTGG 
GCTGTCATTA CATATCTATT 
TGTAATTAGG TGAACTCTAA 
ATACAATGAT TAT 
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Human OB Rprsntor "R" Amino Arid Sequence (Seq. IP No. 3 (Amino 

Acid) ) : 

1 MICQKFCWL LHWEFIYVIT AFNLSYPITP WRFKLSCMPP NSTYDYFLLP 

51 AGLSKNTSNS NGHYETAVEP KFNSSGTHFS NLSKTTFHCC FRSEQDRNCS 

101 LCADNIEGKT FVSTVNSLVF QQ1DANWNIQ CWLKGDLKLF ICYVESLFKN 

151 LFRNYNYKVH LLYVLPEVLE DSPLVPQKGS FQMVHCNCSV HECCECLVPV 

201 PTAKLNDTLL MCLKITSGGV IFQSPLMSVQ PINMVKPDPP LGLHMEITDD 

251 GNLKISWSSP PLVPFPLQYQ VKYSENSTTV IREADKIVSA TSLLVDSILP 

301 GSSYEVQVRG KRLDGPGIWS DWSTPRVFTT QDVIYFPPKI LTSVGSNVSF 

351 HCIYKKENKI VPSKEIVWWM NLAEKIPQSQ YDWSDHVSK VTFFNLNETK 

401 PRGKFTYDAV YCCNEHECHH RYAELYVIDV NINISCETDG YLTKMTCRWS 

451 TSTIQSLAES TLQLRYHRSS LYCSDIPSIH PISEPKDCYL QSDGFYECIF 

501 QPIFLLSGYT MWIRINHSLG SLDSPPTCVL PDSWKPLPP SSVKAEITIN 

551 IGLLKISWEK PVFPENNLQF QIRYGLSGKE VQWKMYEVYD AKSKSVSLPV 

601 PDLCAVYAVQ VRCKRLDGLG YWSNWSNPAY TWMDIKVPM RGPEFWRI IN 

651 GDTMKKEKNV TLLWKPLMKN DSLCSVQRYV INHHTSCNGT WSEDVGNHTK 

701 FTFLWTEQAH TVTVLAINSI GASVANFNLT FSWPMSKVNI VQSLSAYPLN 

751 SSCVIVSWIL SPSDYKLMYF IIEWKNLNED GEIKWLRISS SVKKYYIHDH 

801 FIPIEKYQFS LYPIFMEGVG KPKIINSFTQ DDIEKHQSDA GLYVIVPVII 
851 SSSILLLGTL LISHQRMKKL FWEDVPNPKN CSWAQGLNFQ KKRLSIFLSS 
901 IQHQ*HWLF FWSLKQFQKI SVLIHHGKIK MR*CQQLWSL YFQQQILKRV 
951 LFVLVTSSTV LTSLRLRVLR *PMRTKARDN PLLNTPR-SA TLNQVKLVK 
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Human OB Rfrppto r "B" DNA SpmienrP (Sfa. ID No. 4 (DKA1 ) ; 



10 



15 



20 



25 



30 



35 



40 



45 



50 



1 


CCGCCGCCAT 


CTCTGCCTTC 


GGTCGAGTTG 


GACCCCCGGA 


TCAAGGTGTA 


51 


CTTCTCTGAA 


GTAAGATGAT 


TTGTC4WAAA 


TTCTGTGTGG 


TTTTGTTACA 


101 


TTGGGAATTT 


ATTTATGTGA 


TAACTGCGTT 


TAACTTGTCA 


TATCCAATTA 


151 


CTCCTTGGAG 


ATTTAAGTTG 


TCTTGCATGC 


CACCAAATTC 


AACCTATGAC 


201 


TACTTCCTTT 


TGCCTGCTGG 


ACTCTCAAAG 


AATACTTCAA 


AT TCGAATGG 


251 


ACATTATGAG 


ACAGCTGTTG 


AACCTAAGTT 


TAATTCAAGT 


GGTACTCACT 


301 


TTTCTAACTT 


ATCCAA*AACA 


ACTTTCCACT 


GTTGCTTTCG 


GAGTGAGCAA 


351 


GATAGAAACT 


GCTCCTTATG 


TGCAGACAAC 


ATTGAAGGAA 


AGACATTTGT 


401 


TTCAACAGTA 


AATTCTTTAG 


TTTTTCAACA 


AATAGATGCA 


AACTGGAACA 


451 


TACAGTGCTG 


GCTAAAAGGA 


GACTTAAAAT 


TATTCATCTG 


TTATGTGGAG 


501 


TCATTATTTA 


AGAATCTATT 


CAGGAATTAT 


AACTATAAGG 


TCCATCTTTT 


551 


ATATGTTCTG 


CCTGAAGTGT 


TAGAAGATTC 


ACCTCTGGTT 


CCCCAAAAAG 


601 


GCAGTTTTCA 


GATGGTTCAC 


TGCAATTGCA 


GTGTTCATGA 


ATGTTGTGAA 


651 


TGTCTTGTGC 


CTGTGCCAAC 


AGCCA^CTC 


AACGACACTC 


TCCTTATGTG 


701 


TTTGAAAATC 


ACATCTGGTG 


GAGTAATTTT 


CCAGTCACCT 


CTAATGTCAG 


751 


TTCAGCCCAT 


AAATATGGTG 


AAGCCTGATC 


CACCATTAGG 


TTTGCATATG 


801 


GAAATCACAG 


ATGATGGTAA 


TTTAAAGATT 


TCTTGGTCCA 


GCCCACCATT 


851 


GGTACCATTT 


CCACTTCAAT 


ATCAAGTGAA 


ATATTCAGAG 


AATTCTACAA 


901 


CAGTTATCAG 


AGAAGC TGAC 


AAGATTGTCT 


CAGCTACATC 


CCTGCTAGTA 


951 


GACAGTATAC 


TTCCTGGGTC 


TTCGTATGAG 


GTTCAGGTGA 


GGGGCAAGAG 


1001 


ACTGGATGGC 


CCAGGAATCT 


GGAGTGACTG 


GAGTACTCCT 


CGTGTCTTTA 


1051 


CCACACAAGA 


TGTCATATAC 


TTTCCACCTA 


AAATTCTGAC 


AAGTGTTGGG 


1101 


TCTAATGTTT 


CTTTTCACTG 


CATCTATAAG 


AAGGAAAACA 


AGATTGTTCC 


1151 


CTCAAAAGAG 


ATTGTTTGGT 


GGATGAATTT 


. AGCTGAGAAA 


ATTCCTCAAA 


1201 


GCCAGTATGA 


TGTTGTGAGT 


GATCATGTTA 


GCAAAGTTAC 


TTTTTTCAAT 


1251 


CTGAATGAAA 


CCAAACCTCG 


AGGAAAGTTT 


ACCTATGATG 


CAGTGTACTG 
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1301 CTGCAATGAA CATGAATGCC ATCATCGCTA TGCTGAATTA TATGTGATTG 
1351 ATGTCAATAT CAATATCTCA TGTGAAACTG ATGGGTACTT AACTAAAATG 
1401 ACTTGCAGAT GGTCAACCAG TACAATCCAG TCACTTGCGG AAAGCACTTT 
1451 GCAATTGAGG TATCATAGGA GCAGCCTTTA CTGTTCTGAT ATTCCATCTA 
1501 TTCATCCCAT ATCTGAGCCC AAAGATTGCT ATT TGCAGAG TGATGGTTTT 
1551 TATGAATGCA TTTTCCAGCC AATCTTCCTA TTATCTGGCT ACACAATGTG 
1601 GATTAGGATC AATCAC TCTC TAGGTTCACT TGACTCTCCA CCAACATGTG 
1651 TCCTTCCTGA TTCTGTGGTG AAGCCACTGC CTCCATCCAG TGTGAAAGCA 
1701 GAAATTACTA TAAACATTGG ATTATTGAAA ATATCTTGGG AAAAGCCAGT 
1751 CTTTCCAGAG AATAACCTTC AATTCCAGAT TCGCTATGGT TTAAGTGGAA 
1801 AAGAAGTACA ATGGAAGATG TATGAGGTTT ATGATGCAAA ATCAAAATCT 
1851 GTCAGTCTCC CAGTTCCAGA CTTGTGTGCA GTCTATGCTG TTCAGGTGCG 
1901 CTGTAAGAGG CTAGATGGAC TGGGATATTG GAGTAATTGG AGCAATCCAG 
1951 CCTACACAGT TGTCATGGAT ATAAAAGTTC CTATGAGAGG ACCTGAATTT 
2001 TGGAGAATAA TTAATGGAGA TACTATGAAA AAGGAGAAAA ATGTCACTTT 
2051 ACTTTGGAAG CCCCTGATGA AAAATGACTC ATTGTGCAGT GTTCAGAGAT 
2101 ATGT GAT AAA CCATCATACT TCCTGCAATG GAACATGGTC AGAAGATGTG 
2151 GGAAATCACA CGAAATTCAC TTTCCTGTGG ACAGAGCAAG CACATACTGT 
2201 TACGGTTCTG GCCATCAATT CAATTGGTGC TTCTGTTGCA AATTTTAATT 
2251 TAACCTTTTC ATGGCCTATG AGCAAAGTAA ATATCGTGCA GTCACTCAGT 
2301 GCTTATCCTT TAAACAGCAG TTGTGTGATT GTTTCCTGGA TACTATCACC 
2351 CAGTGATTAC AAGCTAATGT ATTTTATTAT TGAGTGGAAA AATCTTAATG 
2401 AAGATGGTGA AATAAAATGG CTTAGAATCT CTTCATCTGT TAAGAAGTAT 
2451 TATATCCATG ATCATTTTAT CCCCATTGAG AAGTACCAGT TCAGTCTTTA 
2501 CCCAATATTT ATGGAAGGAG TGGGAAAACC AAAGATAATT AATAGTTTCA 
2551 CTCAAGATGA TATTGAAAAA CACCAGAGTG ATGCAGGTTT ATATGTAATT 
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10 



2601 


GTGCCAGTAA 


2651 


ATCACACCAA 


2701 


AGAATTGTTC 


2751 


ATCTTTTTAT 


2801 


GAGCCTGAAA 


2851 


TAAAGATGAG 






2951 


TTCTCTGAGG 


3001 


ACAACCCTTT 


3051 


AAACTGGTGA 
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TTATTTCCTC TTCCATCTTA 
AGAATGAAAA AGCTATTTTG 
CTGGGCACAA GGACTTAATT 
CAAGCATACA GCATCAGTGA 
CAATTTCAGA AGATATCAGT 
ATGATGCCAA CAACTGTGGT 
GGGTTCTGTT TGTTTTAGTG 
CTGAGGGTAC TGAGGTAACC 
GTTAAATACG CCACGCTGAT 
AGA 



TTGCTTGGAA CATTATTAAT 
GGAAGATGTT CCGAACCCCA 
TTCAGAAGAA ACGTTTGAGC 
CATGTGGTCC TCTTCTTTTG 
GTTGATACAT CATGGAAAAA 
CTCTCTACTT TCAACAACAG 
ACCAGTTCAA CAGTGTTAAC 
TATGAGGACG AAAGCCAGAG 
CAGCAAtTCT AAACCAAGTG 
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Human OB PprPnmr " (7" Amino Arid Sequence (Sffq , TP NO . 5 (ftm l 

1 MICQKFCWL LHWEFIYVIT AFNLSYPITP WRFKLSCMPP NSTYDYFLLP 

51 AGLSKNTSNS NGHYETAVEP KFNSSGTHFS NLSKTTFHCC FRSEQDRNCS 

101 LCADNIEGKT FVSTVNSLVF QQIDANWNIQ CWLKGDLKLF ICYVESLFKN 

151 LFRNYNYKVH LLYVLPEVLE DSPLVPQKGS FQMVHCNCSV HECCECLVPV 

201 PTAKLNDTLL MCLKITSGGV IFQSPLMSVQ PINMVKPDPP LGLHMEITDD 

251 GNLKISWSSP PLVPFPLQYQ VKYSENSTTV IREADKIVSA TSLLVDSILP 

301 GSSYEVQVRG KRLDGPGIWS DWSTPRVFTT QDVIYFPPKI LTSVGSNVSF 

351 HCIYKKENKI VPSKEIVWWM NLAEKIPQSQ YDWSDHVSK VTFFNLNETK 

401 PRGKFTYDAV YCCNEHECHH RYAELYVIDV NINISCETDG YLTKMTCRWS 

451 TSTIQSLAES TLQLRYHRSS LYCSDIPSIH PISEPKDCYL QSDGFYECIF 

501 QPIFLLSGYT MWIRINHSLG SLDSPPTCVL PDSWKPLPP SSVKAEITIN 

551 IGLLKISWEK PVFPENNLQF QIRYGLSGKE VQWKMYEVYD AKSKSVSLPV 

601 PDLCAVYAVQ VRCKRLDGLG YWSNWSNPAY TWMDIKVPM RGPEFWRI IN 

651 GDTMKKEKNV TLLWKPLMKN DSLCSVQRYV INHHTSCNGT WSEDVGNHTK 

701 FTFLWTEQAH TVTVLAINSI GASVANFNLT FSWPMSKVNI VQSLSAYPLN 

751 SSCVIVSWIL SPSDYKLMYF I I EWKNLNED GEIKWLRISS SVKKYYIHDH 

801 FIPIEKYQFS LYPIFMEGVG KPKIINSFTQ DDIEKHQSDA GLYVIVPVII 

851 SSSILLLGTL LISHQRMKKL FWEDVPNPKN CSWAQGLNFQ KMLEGSMFVK 

901 SHHHSLISST QGHKHCGRPQ GPLHRKTRDL CSLVYLLTLP PLLSYDPAKS 

951 PSVRNTQE*S IKKKKKKLEG 
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Human QB Rereot.or "C" DNA SeauPnrP (Sf>a. ID No. 6 (DNAM ; 
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1 


CCGCCGCCAT 


CTCTGCCTTC 


GGTCGAGTTG 


GACCCCCGGA 


TCAAGGTGTA 


51 


CTTCTCTGAA 


GTAAGATGAT 


TTGTCAAAAA 


TTCTGTGTGG 


TTTTGTTACA 


101 


TTGGGAATTT 


ATTTATGTGA 


TAACTGCGTT 


TAACTTGTCA 


TATCCAATTA 


151 


CTCCTTGGAG 


ATTTAAGTTG 


TCTTGCATGC 


CACCAAATTC 


AACCTATGAC 


201 


TACTTCCTTT 


TGCCTGCTGG 


ACTCTCAAAG 


AATACTTCAA 


ATTCGAATGG 


251 


AC AT TAT GAG 


ACAGCTGTTG 


AACCTAAGTT 


TAATTCAAGT 


GGTACTCACT 


301 


TTTCTAACTT 


ATCCAAAACA 


ACTTTCCACT 


GTTGCTTTCG 


GAGTGAGCAA 


351 


GATAGAAACT 


GCTCCTTATG 


TGCAGACAAC 


ATTGAAGGAA 


AGACATTTGT 


401 


TTCAACAGTA 


AATTCTTTAG 


TTTTTCAACA 


AATAGATGCA 


AACTGGAACA 


451 


TACAGTGCTG 


GCTAAAAGGA 


GACTTAAAAT 


TATTCATCTG 


TTATGTGGAG 


501 


TCATTATTTA 


AGAATCTATT 


CAGGAATTAT 


AACTATAAGG 


TCCATCTTTT 


551 


ATATGTTCTG 


CCTGAAGTGT 


TAGAAGATTC 


ACCTCTGGTT 


CCCCAAAAAG 


601 


GCAGTTTTCA 


GATGGTTCAC 


TGCAATTGCA 


GTGTTCATGA 


ATGTTGTGAA 


651 


TGTCTTGTGC 


CTGTGCCAAC 


AGCCAAACTC 


AACGACACTC 


TCCTTATGTG 


701 


TTTGAAAATC 


ACATCTGGTG 


GAGTAATTTT 


CCAGTCACCT 


CTAATGTCAG 


751 


TTCAGCCCAT 


AAATATGGTG 


AAGCCTGATC 


CACCATTAGG 


TTTGCATATG 


801 


GAAATCACAG 


ATGATGGTAA 


TTTAAAGATT 


TCTTGG.TCCA 


GCCCACCATT 


851 


GGTACCATTT 


CCACTTCAAT 


ATCAAGTGAA 


ATATTCAGAG 


AATTCTACAA 


901 


CAGTTATCAG 


AGAAGCTGAC 


AAGATTGrTCT 


CAGCTACATC 


CCTGCTAGTA 


951 


GACAGTATAC 


TTCCTGGGTC 


TTCGTATGAG 


GTTCAGGTGA 


GGGGCAAGAG 


1001 


ACTGGATGGC 


CCAGGAATCT 


GGAGTGACTG 


GAGTACTCCT 


CGTGTCTTTA 


1051 


CCACACAAGA 


TGTCATATAC 


TTTCCACCTA 


AAATTCTGAC 


AAGTGTTGGG 


1101 


TCTAATGTTT 


CTTTTCACTG 


CATCTATAAG 


AAGGAAAACA 


AGATTGTTCC 


1151 


CTCAAAAGAG 


ATTGTTTGGT 


GGATGAATTT 


AGC TGAGAAA 


ATTCCTCAAA 


1201 


GCCAGTATGA 


TGTTGTGAGT 


GATCATGTTA 


GCAAAGTTAC 


TTTTTTCAAT 
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1251 CTGAATGAAA CCAAACCTCG AGGAAAGTTT ACCTATGATG CAGTGTACTG 
1301 CTGCAATGAA CATGAATGCC ATCATCGCTA TGCTGAATTA TATGTGATTG 
1351 ATGT CAATAT CAATATCTCA TGTGAAACTG ATGGGTACTT AACTAAAATG 
1401 ACTTGCAGAT GGTCAACCAG TACAATCCAG TCACTTGCGG AAAGCACTTT 
1451 GCAATTGAGG TATCATAGGA GCAGCCTTTA CTGTTCTGAT ATTCCATCTA 
1501 TTCATCCCAT ATCTGAGCCC AAAGATTGCT ATTTGCAGAG TGATGGTTTT 
1551 TATGAATGCA TTTTCCAGCC AATCTTCCTA TTATCTGGCT ACACAATGTG 
1601 GATTAGGATC AATCACTCTC TAGGTTCACT TGACTCTCCA CCAACATGTG 
1651 TCCTTCCTGA TTCTGTGGTG AAGCCACTGC CTCCATCCAG TGTGAAAGCA 
1701 GAAATTACTA TAAACATTGG ATTATTGAAA ATATCTTGGG AAAAGCCAGT 
1751 CTTTCCAGAG AATAACCTTC AATTCCAGAT TCGCTATGGT TTAAGTGGAA 
1801 AAGAAGTACA ATGGAAGATG TATGAGGTTT ATGATGCAAA ATCAAAATCT 
1851 GTCAGTCTCC CAGTTCCAGA CTTGTGTGCA GTCTATGCTG TTCAGGTGCG 
1901 CTGTAAGAGG CTAGATGGAC TGGGATATTG GAGTAATTGG AGCAATCCAG 
1951 CCTACACAGT TGTCATGGAT ATAAAAGTTC CTATGAGAGG ACCTGAATTT 
2001 TGGAGAATAA TTAATGGAGA TACTATGAAA AAGGAGAAAA ATGTCACTTT 
2051 ACTTTGGAAG CCCCTGATGA AAAATGACTC A7TGTGCAGT GTTCAGAGAT 
2101 ATGTGATAAA CCATCATACT TCCTGCAATG GAACATGGTC AGAAGATGTG 
2151 GGAAATCACA CGAAATTCAC TTTCCTGTGG ACAGAGCAAG CACATACTGT 
2201 TACGGTTCTG GCCATCAATT CAATTGGTGC TTCTGTTGCA AATTTTAATT 
2251 TAACCTTTTC ATGGCCTATG AGCAAAGTAA ATATCGTGCA GTCACTCAGT 
2301 GCTTATCCTT TAAACAGCAG TTGTGTGATT GTTTCCTGGA TACTATCACC 
2351 CAGTGATTAC AAGCTAATGT ATTTTATTAT TGAGTGGAAA AATCTTAATG 
2401 AAGATGGTGA AATAAAATGG CTTAGAATCT CTTCATCTGT TAAGAAGTAT 
2451 TATATCCATG ATCATTTTAT CCCCATTGAG AAGTACCAGT TCAGTCTTTA 
2501 CCCAATATTT ATGGAAGGAG TGGGAAAACC AAAGATAATT AATAGTTTCA 
2551 CTCAAGATGA TATTGAAAAA CACCAGAGTG ATGCAGGTTT ATATGTAATT 
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10 



2601 


GTGCCAGTAA 


2651 


ATCACACCAA 


2701 


AGAATTGTTC 


2751 


AGCATGTTCG 


2801 


ACACAAACAC 


2851 


ACCTTTGTTC 


2901 


GACCCTGCCA 


2951 


AAAAAAAAAA 
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TTATTTCCTC TTCCATCTTA 
AGAATGAAAA AGCTATTTTG 
CTGGGCACAA GGACTTAATT 
TTAAGAGTCA TCACCACTCC 
TGCGGAAGGC CACAGGGTCC 
ACTTGTTTAT CTGCTGACCC 
AATCCCCCTC TGTGAGAAAC 
AAAAAACTCG AGGGGG 



TTGCTTGGAA CATTATTAAT 
GGAAGATGTT CCGAACCCCA 
TTCAGAAGAT GCTTGAAGGC 
CTAATCTCAA GTACCCAGGG 
TCTGCATAGG AAAACCAGAG 
TCCCTCCACT ATTGTCCTAT 
ACCCAAGAAT GATCAATAAA 
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p„m*n OB Rprppfnr "n" Amino Acid sequence (Sequence ID No . 7) 

1 MICQKFCWL LHWEFI YVIT AFNLSYPITP WRFKLSCMPP NSTYDYFLLP 

51 AGLSKNTSNS NGHYETAVEP KFNSSGTHFS NLSKTTFHCC FRSEQDRNCS . 
101 LCADNIEGKT FVSTVNSLVF QQIDANWNIQ CWLKGDLKLF ICYVESLFKN 
151 LFRNYNYKVH LLYVLPEVLE DSPLVPQKGS FQMVHCNCSV HECCECLVPV 
201 PTAKLNDTLL MCLKITSGGV IFQSPLMSVQ PINMVKPDPP LGLHMEITDD 
251 GNLKISWSSP PLVPFPLQYQ VKYSENSTTV IREADKIVSA TSLLVDSILP 
301 GSSYEVQVRG KRLDGPGIWS DWSTPRVFTT QDVIYFPPKI LTSVGSNVSF 
351 HCIYKKENKI VPSKEIVWWM NLAEKIPQSQ YDWSDHVSK VTFFNLNETK 
401 PRGKFTYDAV YCCNEHECHH RYAELYVIDV NINISCETDG YLTKMTCRWS 
451 TSTIQSLAES TLQLRYHRSS LYCSDIPSIH PISEPKDCYL QSDGFYECIF 
501 QPIFLLSGYT MWIRINHSLG SLDSPPTCVL PDSWKPLPP SSVKAEITIN 
551 IGLLKISWEK PVFPENNLQF QIRYGLSGKE VQWKMYEVYD AKSKSVSLPV 
601 PDLCAVYAVQ VRCKRLDGLG YWSNWSNPAY TWMDIKVPM RGPEFWRI I N 
651 GDTMKKEKNV TLLWKPLMKN DSLCSVQRYV INHHTSCNGT WSEDVGNHTK 
701 FTFLWTEQAH TVTVLAINSI GASVANFNLT FSWPMSKVNI VQSLSAYPLN 
751 SSCVIVSWIL SPSDYKLMYF I IEWKNLNED GEIKWLRISS SVKKYYIHDH 
801 FIPIEKYQFS LYPIFMEGVG KPKIINSFTQ DDIEKHQSDA GLYVIVPVII 
851 SSSILLLGTL LI SHQRMKKL FWEDVPNPKN CSWAQGLNFQ KPETFEHLFI 
901 KHTASVTCGP LLLEPETISE DISVDTSWKN KDEMMPTTW SLLSTTDLEK 
951 GSVCISDQFN SVNFSEAEGT EVTYEDESQR QPFVKYATLI SNSKPSETGE 
1001 EQGLINSSVT KCFSSKNSPL KDSFSNSSWE IEAQAFFILS DQHPNIISPH 
1051 LTFSEGLDEL LKLEGNFPEE NNDKKSIYYL GVTSIKKRES GVLLTDKSRV 
1101 SCPFPAPCLF TDIRVLQDSC SHFVENNINL GTSSKKTFAS YMPQFQTCST 
1151 QTHKIMENKM CDLTV*FH*R NLQICVIMGN IKCNRL*LWV GERKETRVKF 
1201 ENNCSK*KKK KKNSRPARPD 
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Human OB R^Df or "D" Nucleic Arid SpmiPnce ( Sequence ID No . 



10 



15 



20 



25 



30 



35 



40 



45 



50 



1 


GCGGCCGCCA 


GTGTGATGGA 


TATCTGCAGA 


ATTCGGCTTT 


CTCTGCCTTC 


51 


GGTCGAGTTG 


GACCCCCGGA 


TCAAGGTGTA 


CTTCTCTGAA 


GTAAGATGAT 


101 


TTGTCAAAAA 


TTCTGTGTGG 


TTTTGTTACA 


TTGGGAATTT 


ATTTATGTGA 


151 


TAACTGCGTT 


TAACTTGTCA 


TATCCAATTA 


CTCCTTGGAG 


ATTTAAGTTG 


201 


TCTTGCATGC 


CACCAAATTC 


AACCTATGAC 


TACTTCCTTT 


TGCCTGCTGG 


251 


GCTCTCAAAG 


AATACTTCAA 


ATTCGAATGG 


ACATTATGAG 


ACAGCTGTTG 


301 


AACCTAAGTT 


TAATTCAAGT 


GGTACTCACT 


TTTCTAACTT 


ATCCAAAACA 


351 


ACTTTCCACT 


GTTGCTTTCG 


GAGTGAGCAA 


GATAGAAACT 


GCTCCTTATG 


401 


TGCAGACAAC 


ATTGAAGGAA 


AGACATTTGT 


TTCAACAGTA 


AATTCTTTAG 


451 


TTTTTCAACA 


AATAGATGCA 


AACTGGAACA 


TACAGTGCTG 


GCTAAAAGGA 


501 


GACTTAAAAT 


TATTCATCTG 


TTATGTGGAG 


TCATTATTTA 


AGAATCTATT 


551 


CAGGAATTAT 


AACTATAAGG 


TCCATCTTTT 


ATATGTTCTG 


CCTGAAGTGT 


601 


TAGAAGATTC 


ACCTCTGGTT 


CCCCAAAAAG 


GCAGTTTTCA 


GATGGTTCAC 


651 


TGCAATTGCA 


GTGTTCACGA 


ATGTTGTGAA 


TGTCTTGTGC 


CTGTGCCAAC 


701 


AGCCAAACTC 


AACGACACTC 


TCCTTATGTG 


TTTGAAAATC 


ACATCTGGTG 


751 


GAGTAAT7TT 


CCAGTCACCT 


CTAATGTCAG 


TTCAGCCCAT 


AAATATGGTG 


801 


AAGCCTGATC 


CACCATTAGG 


TTTGCATATG 


GAAATCACAG 


ATGATGGTAA 


851 


TTTAAAGATT 


TCTTGGTCCA 


GCCCACCATT 


GGTACCATTT 


CCACTTCAAT 


901 


ATCAAGTGAA 


ATATTCAGAG 


AATTCTACAA 


CAGTTATCAG 


AGAAGCTGAC 


951 


AAGATTGTCT 


CAGCTACATC 


CCTGCTAGTA 


GACAGTATAC 


TTCCTGGGTC 


1001 


TTCGTATGAG 


GTTCAGGTGA 


GGGGCAAGAG 


ACTGGATGGC 


CCAGGAATCT 


1051 


GGAGTGACTG 


GAGTACTCCT 


CGTGTCTTTA 


CCACACAAGA 


TGTCATATAC 


1101 


TTTCCACCTA 


AAATTCTGAC 


AAGTGTTGGG 


TCTAATGTTT 


CTTTTCACTG 


1151 


CATCTATAAG 


AAGGAAAACA 


AGATTGTTCC 


.CTCAAAAGAG 


ATTGTTTGGT 


1201 


GGATGAATTT 


AGCTGAGAAA 


ATTCCTCAAA 


GCCAGTATGA 


TGTTGTGAGT 


1251 


GATCATGTTA 


GCAAAGTTAC 


TTTTTTCAAT 


CTGAATGAAA 


CCAAACCTCG 
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1301 AGGAAAGTTT ACCTATGATG CAGTGTACTG CTGCAATGAA CATGAATGCC 
1351 AT CATCGCTA TGCTGAATTA TATGTGATTG ATGTCAATAT CAATATCTCA 
1401 TGTGAAACTG ATGGGTACTT AACTAAAATG ACTTGCAGAT GGTCAACCAG 
1451 TACAATCCAG TCACTTGCGG AAAGCACTTT GCAATTGAGG TATCATAGGA 
1501 GCAGCCTTTA CTGTTCTGAT ATTCCATCTA TTCATCCCAT ATCTGAGCCC 
1551 AAAGATTGCT ATTTGCAGAG TGATGGTTTT TATGAATGCA TTTTCCAGCC 
1601 AATCTTCCTA TTATCTGGCT ACACAATGTG GAT TAGGAT C AATCACTCTC 
1651 TAGGTTGACT TGACTCTCCA CCAACATGTG TCCTTCCTGA TTCTGTGGTG 
1701 AAGCCACTGC CTCCATCCAG TGTGAAAGCA GAAAT TACT A TAAACATTGG 
1751 ATTATTGAAA ATATCTTGGG AAAAGCCAGT CTTTCCAGAG AATAACCTTC 
1801 AATTCCAGAT TCGCTATGGT TTAAGTGGAA AAGAAGTACA ATGGAAGATG 
1851 TATGAGGTTT ATGATGCAAA ATCAAAATCT GTCAGTCTCC CAGTTCCAGA 
1901 CTTGTGTGCA GTCTATGCTG TTCAGGTGCG CTGTAAGAGG CTAGATGGAC 
1951 TGGGATATTG GAGTAATTGG AGCAATCCAG CCTACACAGT TGTCATGGAT 
2001 ATAAAAGTTC CTATGAGAGG ACCTGAATTT TGGAGAATAA TTAATGGAGA 
2051 TACTATGAAA AAGGAGAAAA ATGTCACTTT ACTTTGGAAG CCCCTGATGA 
2101 AAAATGACTC ATTGTGCAGT GTTCAGAGAT ATGTGATAAA CCATCATACT 
2151 TCCTGCAATG GAACATGGTC AGAAGATGTG GGAAATCACA CGAAATTCAC 
2201 TTTCCTGTGG ACAGAGCAAG CACATACTGT TACGGTTCTG GCCATCAATT 
2251 CAATTGGTGC TTCTGTTGCA AATTTTAATT TAACCTTTTC ATGGCCTATG 
2301 AGCAAAGTAA ATATCGTGCA GTCACTCAGT GCTTATCCTT TAAACAGCAG 
2351 TTGTGTGATT GTTTCCTGGA TACTATCACC CAGTGATTAC AAGCTAATGT 
2401 ATTTTATTAT TGAGTGGAAA AATCTTAATG AAGATGGTGA AATAAAATGG 
2451 CTTAGAATCT CTTCATCTGT TAAGAAGTAT TATATCCATG ATCATTTTAT 
2501 CCCCATTGAG AAGTACCAGT TCAGTCTTTA CCCAATATTT ATGGAAGGAG 
2551 TGGGAAAACC AAAGATAATT AATAGTTTCA CTCAAGATGA TATTGAAAAA 
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2601 
2651 
5 2701 
2751 
2801 

10 

2851 
2901 
15 2951 
3001 
3051 

20 

3101 
3151 
25 3201 
3251 
3301 

30 

3351 
3401 
35 3451 
3501 
3551 



CACCAGAGTG 
TTCCATCTTA 
AGCTATTTTG 
GGACTTAATT 
TACAGCATCA 
CAGAAGATAT 
CCAACAACTG 
TGTTTGTATT 
GTACTGAGGT 
TACGCCACGC 
AGGGCTTATA 
CGTTGAAGGA 
TTTTTTATAT 
ATTCTCAGAA 
AAGAAAATAA 
AAAAAGAGAG 
CCCATTCCCA 
GTTGCTCACA 
AAGACTTTTG 
TCATAAGATC 



ATGCAGGTTT 
TTGCTTGGAA 
GGAAGATGTT 
TTCAGAAGCC 
GTGACATGTG 
CAGTGTTGAT 
TGGTCTCTCT 
AGTGACCAGT 
AACCTATGAG 
TGATCAGCAA 
AATAGTTCAG 
TTCTTTCTCT 
TATCGGATCA 
GGATTGGATG 
TGATAAAAAG 
AGAGTGGTGT 
GCCCCCTGTT 
CTTTGTAGAA 
CATCTTACAT 
ATGGAAAACA 
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ATATGTAATT 
CATTATTAAT 
CCGAACCCCA 
AGAAACGTTT 
GTCCTCTTCT 
ACATCATGGA 
ACTTTCAACA 
TCAACAGTGT 
GACGAAAGCC 
CTCTAAACCA 
TCACCAAGTG 
AATAGCTCAT 
GCATCCCAAC 
AACTTTTGAA 
TCTATCTATT 
GCTTTTGACT 
TATTCACGGA 
AATAATATCA 
GCCTCAATTC 
AGATGTGTGA 



GTGCCAGTAA 
ATCACACCAA 
AGAATTGTTC 
GAGCATCTTT 
TTTGGAGCCT 
AAAATAAAGA 
ACAGATCTTG 
TAACTTCTCT 
AGAGACAACC 
AGTGAAACTG 
CTTCTCTAGC 
GGGAGATAGA 
ATAATTTCAC 
ATTGGAGGGA 
ATTTAGGGGT 
GACAAGTCAA 
CATCAGAGTT 
ACTTAGGAAC 
CAAACTTGTT 
CCTAACTGTG 



TTATTTCCTC 
AGAATGAAAA 
CTGGGCACAA 
TTATCAAGCA 
GAAACAATTT 
TGAGATGATG 
AAAAGGGTTC 
GAGGCTGAGG 
CTTTGTTAAA 
GTGAAGAACA 
AAAAATTCTC 
GGCCCAGGCA 
CACACCTCAC 
AATTTCCCTG 
CACCTCAATC 
GGGTATCGTG 
CTCCAGGACA 
TTCTAGTAAG 
CTACTCAGAC 
TAATCTAGA 
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Human OB Rprpptor Protein "D" Chromosomal PNA (Seq. IP HO ■ 9) 



Intron 1 



.taccttttccag 



GTG TAC TTC 



CAT TOG G gtaagttatttg. 
His Trp Glu 
12 13 14 



Intron 2 



.atatcctaacag AA ' TTT ATT 

Pho lie 
15 16 



CAA ATA G gtaagcattagc. 
Gin lie Asp 
122 123 124 



Intron 3 



.ttttaaattcag 



AT GCA AAC 
Ala Aan 
125 126 



TAT GTT CT gtaagtaccaaa . 
Tyr Val Leu 
163 164 165 



Intron 4 



.txtrcaatatag 



G OCT GAA 
Pro Glu 
166 167 



AAT ATG G gtaagttatgca . 
Aan Met Val 
233 234 235 



Intron 5 



.tttttccttaag 



TG AAG OCT 
Lys Pro 
236 237 



ATC AGA GAA gtaagtatattt . 
lie Arg Glu 
281 282 283 



Intron 6 



.aatatttaacag 



GOT GAC AAG 
Ala Asp Lys 
284 285 286 



AGA CAA G gtaggttatgta . 
Thr Gin Asp 
330 331 332 



Intron 7 



. ccctcatracag 



AT GTC ATA 
Val He 

333 334 



GTG ATT G gtaagaaaacag . 
Val He Asp 
427 428 429 



Intron 8 



. tgtttcaaatag 



AT GTC AAT 
Val Asn 
430 431 



TAT CAT AG gtacgtattatt . 
Tyr His Arg 
466 467 468 



Intron 9 



.tatcttttaaag 



G AGC AGC 
Ser Ser 
469 470 



TCT GTC G gtatgtcaagct Intron 10 aaaaatttctag TG AAG CCA 



Ser Val Val 
533 534 535 



Lys Pro 

536 537 



CAA TGG AAG otaccttttact Intron 11 cttattttacag ATG TAT GAG 

Uof *TVfT» flln 



Gin Trp Lys 
582 583 584 



Met Tyr GXu 
585 586 587 



ATA AAA G gtctgcagagat 
He Lys Val 
636 637 638 



Intron 12 gtcattttgcag 



TT OCT ATG 
Pro Met 
639 640 



WO 97/25424 



PCT/US97/00128 



- 81 - 



CTT TGG AAG gtattcccaatt . 
Leu Trp Lya 
663 664 665 



Intron 13 



. t att tactacag 



CCC CTG ATG 
Pro Leu Met 
666 667 668 



AGC AAA G gtaagaagaggt . 
Ser Lya Val 
736 737 738 



Intron 14 



. ttttcccct cag 



TA AAT ATC 
Asn He 
739 740 



ATC CAT G gtaagtttacta . 
lie His Asp 
797 798 799 



Intron 15 



.ttttctcctcag 



AT CAT TTT 
His Phe 
800 801 



ACT CAA G grtaaaaatxata . 
Thr Gin Asp 
829 830 831 



Intron 16 



.tttctttttcag 



AT GAT ATT 
Asp lie 
832 833 



CAC CAA AG gtattgtacttg. 
His Gin Arg 
864 865 866 



Intron ,17 



. tatcctttgtag 



ATG AAA 
Mot Lys 
867 868 



TTT CAG AAG gttgetttttea . 
Phe Gin Lys 
889 890 891 



Intron 18 



.ttatctaaacag 



Exon A 
AGA ACG GAC 
Arg Thr Asp 
892 893 894 



Exon A 

AAA TAT GAT gtacatttgtct . 



Intron 18 



.cttttettttag 



Exon D 

CCA GAA ACG 

Pro Glu Thr 

892 893 894 



Exon B 

AAA OGT TTG 

Lys Arg Leu 

892 893 894 



Exon D 

GAA ACC AGA gtatccagtgtt , 



Intron 18 



. ctttttaaacag 



Exon C 

ATG CTT GAA 

Met Leu Glu 

892 893 894 
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^im^n or Por- B r,t n r P r nr»in. Rernmhi nant Serrated Receptor am i no 

rlfii ^ "p"" gnrp *S«n. Tr >- NQ - 1Q) : 



1 


MICQKFCWL 


LHWEFIYVIT 


AFNLSYPITP 


WRFKLSCMPP : 


NSTYDYFLLP 


51 


AGLSKNTSNS 


NGHYETAVEP 


KFNSSGTHFS 


NLSKTTFHCC 


FRSEQDRNCS 


101 


LCADNIEGKT 


FVSTVNSLVF 


QQIDANWNIQ 


CWLKGDLKLF 


ICYVESLFKN 


151 


LFRNYNYKVH 


LLYVLPEVLE 


DSPLVPQKGS 


FQMVHCNCSV 


HECCECLVPV 


201 


PTAKLNDTLL 


MCLKITSGGV 


IFQSPLMSVQ 


PINMVKPDPP 


LGLHMEITDD 


251 


GNLKISWSSP 


PLVPFPLQYQ 


VKYSENSTTV 


IREADKIVSA 


TSLLVDSILP 


301 


GSSYEVQVRG 


KRLDGPGIWS 


DWSTPRVFTT 


QDVIYFPPKI 


LTSVGSNVSF 


351 


HCIYKKENKI 


VPSKEIVWWM 


NLAEKIPQSQ 


YDWSDHVSK 


VTFFNLNETK 


401 


PRGKFTYDAV 


YCCNEHECHH 


RYAELYVIDV 


NINISCETDG 


YLTKMTCRWS 


451 


TSTIQSLAES 


TLQLRYHRSS 


LYCSDIPSIH 


PISEPKDCYL 


QSDGFYECIF 


501 


QPIFLLSGYT 


MWIRINHSLG 


SLDSPPTCVL 


PDSWKPLPP 


SSVKAEITIN 


551 


IGLLKISWEK 


PVFPENNLQF 


QIRYGLSGKE 


VQWKMYEVYD 


AKSKSVSLPV 


601 


PDLCAVYAVQ 


VRCKRLDGLG 


YWSNWSNPAY 


TWMDIKVPM 


RGPEFWRIIN 


651 


GDTMKKEKNV 


TLLWKPLMKN 


DSLCSVQRYV 


INHHTSCNGT 


WSEDVGNHTK 


701 


FTFLWTEQAH 


TVTVLAINSI 


GASVANFNLT 


FSWPMSKVNI 


VQSLSAYPLN 


751 


SSCVIVSWIL 


SPSDYKLMYF 


I IEWKNLNED 


GEIKWLRISS 


SVKKYYIHDH 


801 


FIPIEKYQFS 


LYPIFMEGVG 


; KPKIINSFTQ 


> DDIEKHQSD 
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Human OB Recepto r Protein. Recomhi nanl- Secreted Receptor DNA 
sequence fSeq. TP. No. 11): 



10 



15 



20 



25 



30 



35 



40 



45 



50 



1 


GCGGCCGCCA 


GTGTGATGGA 


TATCTGCAGA 


ATTCGGCTTT 


CTCTGCCTTC 


51 


GGTCGAGTTG 


GACCCCCGGA 


TCAAGGTGTA 


CTTCTCTGAA 


GTAAGATGAT 


101 


TTGTCAAAAA 


TTCTGTGTGG 


TTTTGTTACA 


TTGGGAATTT 


ATTTATGTGA 


151 


TAACTGCGTT 


TAACTTGTCA 


TATCCAATTA 


CTCCTTGGAG 


ATTTAAGTTG 


201 


• TCTTGCATGC 


CACCAAATTC 


AACCTATGAC 


TACTTCCTTT 


TGCCTGCTGG 


251 


GCTCTCAAAG 


AATACTTCAA 


ATTCGAATGG 


ACATTATGAG 


ACAGCTGTTG 


301 


AACCTAAGTT 


TAATTCAAGT 


GGTACTCACT 


TTTCTAACTT 


ATCCAAAACA 


351 


ACTTTCCACT 


GTTGCTTTCG 


GAGTGAGCAA 


GATAGAAACT 


GCTCCTTATG 


401 


TGCAGACAAC 


ATTGAAGGAA 


AGACATTTGT 


TTCAACAGTA 


AATTCTTTAG 


451 


TTTTTCAACA 


AATAGATGCA 


AACTGGAACA 


TACAGTGCTG 


GCTAAAAGGA 


501 


GACT TAAAAT 


TATTCATCTG 


TTATGTGGAG 


TCATTATTTA 


AGAATCTATT 


551 


CAGGAATTAT 


AACTATAAGG 


TCCATCTTTT 


ATATGTTCTG 


CCTGAAGTGT 


601 


TAGAAGATTC 


ACCTCTGGTT 


CCCCAAAAAG 


GCAGTTTTCA 


GATGGTTCAC 


651 


TGCAATTGCA 


GTGTTCACGA 


ATGTTGTGAA 


TGTCTTGTGC 


CTGTGCCAAC 


701 


AGCCAAACTC 


AACGACACTC 


TCCTTATGTG 


TTTGAAAATC 


ACATCTGGTG 


751 


GAGTAATTTT 


CCAGTCACCT 


CTAATGTCAG 


TTCAGCCCAT 


AAATATGGTG 


801 


AAGCCTGATC 


CACCATTAGG 


TTTGCATATG 


GAAATCACAG 


ATGATGGTAA 


851 


TTTAAAGATT 


TCTTGGTCCA 


GCCCACCATT 


GGTACCATTT 


CCACTTCAAT 


901 


ATCAAGTGAA 


ATATTCAGAG 


AATTCTACAA 


CAGTTATCAG 


AGAAGCTGAC 


Z70 ± 












1001 


TTCGTATGAG 


GTTCAGGTGA 


GGGGCAAGAG 


ACTGGATGGC 


CCAGGAATCT 


1051 


GGAGTGACTG 


GAGTACTCCT 


CGTGTCTTTA 


CCACACAAGA 


TGTCATATAC 


1101 


TTTCCACCTA 


AAATTCTGAC 


AAGTGTTGGG 


TCTAATGTTT 


CTTTTCACTG 


1151 


CATC T ATAAG 


AAGGAAAACA 


AGATTGTTCC 


CTCAAAAGAG 


ATTGTTTGGT 


1201 


GGATGAATTT 


AGCTGAGAAA 


ATTCCTCAAA 


GCCAGTATGA 


TGTTGTGAGT 
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1251 GATCATGTTA GCAAAGTTAC TTTTTTCAAT CTGAATGAAA CCAAACCTCG 
1301 AGGAAAGTTT ACCTAT GATG CAGTGTACTG CTGCAATGAA CATGAATGCC 
1351 ATCATCGCTA TGCTGAATTA TATGTGATTG ATGTCAATAT CAATATCTCA 
1401 TGTGAAACTG ATGGGTACTT AACTAAAATG ACTTGCAGAT GGTCAACCAG 
1451 TACAATCCAG TCACTTGCGG AAAGCACTTT GCAATTGAGG TATCATAGGA 
1501 GCAGCCTTTA CTGTTCTGAT ATTCCATCTA TTCATCCCAT ATCTGAGCCC 
1551 AAAGATTGCT ATTTGCAGAG TGATGGTTTT TATGAATGCA TTTTCCAGCC 
1601 AATCTTCCTA TTATCTGGCT ACACAATGTG GATTAGGATC AATCACTCTC 
1651 TAGGTTCACT TGACTCTCCA CCAACATGTG TCCTTCCTGA TTCTGTGGTG 
1701 AAGCCACTGC CTCCATCCAG TGTGAAAGCA GAAATTACTA TAAACATTGG 
1751 ATT AT TGAAA ATATCTTGGG AAAAGCCAGT CTTTCCAGAG AATAACCTTC 
1801 AATTCCAGAT TCGCTATGGT TTAAGTGGAA AAGAAGTACA ATGGAAGATG 
1851 TATGAGGTTT ATGATGCAAA ATCAAAATCT GTCAGTCTCC CAGTTCCAGA 
1901 CTTGTGTGCA GTCTATGCTG TTCAGGTGCG CTGTAAGAGG CTAGATGGAC 
1951 TGGGATATTG GAGTAATTGG AGCAATCCAG CCTACACAGT TGTCATGGAT 
2001 ATAAAAGTTC CTATGAGAGG ACCTGAATTT TGGAGAATAA TTAATGGAGA 
2051 TACTATGAAA AAGGAGAAAA ATGTCACTTT ACTTTGGAAG CCCCTGATGA 
2101 AAAATGACTC ATTGTGCAGT GTTCAGAGAT ATGTGATAAA CCATCATACT 
2151 TCCTGCAATG GAACATGGTC AGAAGAT GTG GGAAATCACA CGAAATTCAC 
2201 TTTCCTGTGG ACAGAGCAAG CACATACTGT TACGGTTCTG GCCATCAATT 
2251 CAATTGGTGC TTCTGTTGCA AATTTTAATT TAACCTTTTC ATGGCCTATG 
2301 AGCAAAGTAA ATATCGTGCA GTCACTCAGT GCTTATCCTT TAAACAGCAG 
2351 TTGTGTGATT GTTTCCTGGA TACTATCACC CAGTGATTAC AAGCTAATGT 
2401 ATTTTATTAT TGAGTGGAAA AATCTTAATG AAGATGGTGA AATAAAATGG 
2451 CTTAGAATCT CTTCATCTGT TAAGAAGTAT TATATCCATG ATCATTTTAT 
2501 CCCCATTGAG AAGTACCAGT TCAGTCTTTA CCCAATATTT ATGGAAGGAG 



WO 97/25424 



PCI7US97/00128 



- 85 - 

2551 TGGGAAAACC AAAGATAATT AATAGTTTCA CTCAAGATGA TATTGAAAAA 
2601 CACCAGAGTG ATTGATAAGG ATCC 
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p ima n n R R P ^ r ^r pmr^n. Recombinant Sfifi^fd w^Pto*- PH . * . 

COT iPnrP t^i-h r-tfrminal FT.AG (Sea. TP NO. 1 2 ) : 
1 CCATTGAAGT CAATGGGAGT TTGTTTTGGC ACCAAAATCA ACGGGGATTT 

51 CCAAAATGTC GTAATAACCC CGCCCCGTTG ACGCAAATGG GCGGTAGGCG 
101 TGTACGGTGG GAGGTCTATA TAAGCAGAGC TCGTTTAGTG AACCGTCAGA 
151 TCTCTAGAAG CTGGGTACCA GCTGCTAGCA AGCTTGCTAG CGGCCGCCAG 
201 TGTGATGGAT ATCTGCAGAA TTCGGCTTTC TCTGCCTTCG GTCGAGTTGG 
251 ACCCCCGGAT CAAGGTGTAC TTCTCTGAAG TAAGATGATT TGTCAAAAAT 
301 TCTGTGTGGT TTTGTTACAT TGGGAATTTA TTTATGTGAT AACTGCGTTT 
351 AACTTGTCAT ATCCAAT TAC TCCTTGGAGA TTTAAGTTGT CTTGCATGCC 
401 ACCAAATTCA ACCTATGACT ACTTCCTTTT GCCTGCTGGG CTCTCAAAGA 
451 ATACTTCAAA TTCGAAT GGA CATTATGAGA CAGCTGTTGA ACCTAAGTTT 
501 AATTCAAGTG GTACTCACTT TTCTAACTTA TCCAAAACAA CTTTCCACTG 
551 TTGCTTTCGG AGTGAGCAAG ATAGAAACTG CTCCTTATGT GCAGACAACA 
601 TTGAAGGAAA GACATTTGTT TCAACAGTAA ATTCTTTAGT TTTTCAACAA 
651 ATAGATGCAA ACTGGAACAT ACAGTGCTGG CTAAAAGGAG ACTTAAAATT 
701 ATTCATCTGT TATGTGGAGT CATTATTTAA GAATCTATTC AGGAATTATA 
751 ACTATAAGGT CCATCTTTTA TATGTTCTGC CTGAAGTGTT AGAAGATTCA 
801 CCTCTGGTTC CCCAAAAAGG CAGTTTTCAG ATGGTTCACT GCAAT TGCAG 
851 TGTTCACGAA TGTTGTGAAT GTCTTGTGCC TGTGCCAACA GCCAAACTCA 
901 ACGACACTCT CCTTATGTGT TTGAAAATCA CATCTGGTGG AGTAATTTTC 
951 CAGTCACCTC TAATGTCAGT TCAGCCCATA AATATGGTGA AGCCTGATCC 
1001 ACCATTAGGT TTGCATATGG AAATCACAGA TGATGGTAAT TTAAAGATTT 
1051 CTTGGTCCAG CCCACCATTG GTACCATTTC CACTTCAATA TCAAGTGAAA 
1101 TATTCAGAGA ATTCTACAAC AGT TATCAGA GAAGCTGACA AGATTGTCTC 
1151 AGCTACATCC CTGCTAGTAG ACAGTATACT TCCTGGGTCT TCGTATGAGG 
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1201 


TTCAGGTGAG 


GGGCAAGAGA 


CTGGATGGCC 


CAGGAATCTG 


GAGTGACTGG 




1251 


AGTACTCCTC 


GTGTCTTTAC 


CACACAAGAT 


GTCATATACT 


TTCCACCTAA 


5 


1301 


AATTCTGACA 


AGTGTTGGGT 


CTAATGTTTC 


TTTTCACTGC 


ATCTATAAGA 




1351 


AGGAAAACAA 


GATTGTTCCC 


TCAAAAGAGA 


TTGTTTGGTG 


GATGAATTTA 


10 


1401 


GCTGAGAAAA 


TTCCTCAAAG 


CCAGTATGAT 


GTTGTGAGTG 


ATCATGTTAG 


1451 


CAAAGTTACT 


TTTTTCAATC 


TGAATGAAAC 


CAAACCTCGA 


GGAAAGTTTA 




1501 


CCTATGATGC 


AGTGTACTGC 


TGCAATGAAC 


ATGAATGCCA 


TCATCGCTAT 


15 


1551 


GCTGAATTAT 


ATGTGATTGA 


TGTCAATATC 


AATATCTCAT 


GTGAAACTGA 




1601 


TGGGTACTTA 


ACTAAAATGA 


CTTGCAGATG 


GTCAAGCAGT 


ACAATCCAGT 


20 


1651 


CACTTGCGGA 


AAGCACTTTG 


CAATTGAGGT 


ATCATAGGAG 


CAGCCTTTAC 


1701 


TGTTCTGATA 


TTCCATCTAT 


TCATCCCATA 


TCTGAGCCCA 


AAGATTGCTA 




1751 


TTTGCAGAGT 


GATGGTTTTT 


ATGAATGCAT 


TTTCCAGCCA 


ATCTTCCTAT 


25 


1801 


TATCTGGCTA 


CACAATGTGG 


ATTAGGATCA 


ATCACTCTCT 


AGGTTCACTT 




1851 


GACTCTCCAC 


CAACATGTGT 


CCTTCCTGAT 


TCTGTGGTGA 


AGCCACTGCC 


30 


1901 


TCCATCCAGT 


GTGAAAGCAG 


AAATTACTAT 


AAACATTGGA 


TTATTGAAAA 


1951 


TATCTTGGGA 


AAAGCCAGTC 


TTTCCAGAGA 


ATAACCTTCA 


ATTCCAGATT 




2001 


CGCTATGGTT 


TAAGTGGAAA 


AGAAGTACAA 


TGGAAGATGT 


ATGAGGTTTA 


35 


2051 


TGATGCAAAA 


TCAAAATCTG 


TCAGTCTCCC 


AGTTCCAGAC 


TTGTGTGCAG 




2101 


TCTATGCTGT 


TCAGGTGCGC 


TGTAAGAGGC 


TAG ATGGAC T 


GGGATATTGG 


40 


2151 


AGTAATTGGA 


GCAATCCAGC 


CTACACAGTT 


GTCATGGATA 


TAAAAGTTCC 


2201 


TATGAGAGGA 


CCTGAATTTT 


GGAGAATAAT 


TAATGGAGAT 


ACTATGAAAA 




2251 


AGGAGAAAAA 


TGTCACTTTA 


CTTTGGAAGC 


CCCTGATGAA 


AAATGACTCA 


45 


2301 


TTGTGCAGTG 


TTCAGAGATA 


TGTGATAAAC 


CATCATACTT 


CCTGCAATGG 




2351 


AACATGGTCA 


GAAGATGTGG 


GAAATCACAC 


GAAATTCACT 


TTCCTGTGGA 


DO 


2401 


CAGAGCAAGC 


ACATACTGTT 


ACGGTTCTGG 


CCATCAATTC 


AATTGGTGCT 


2451 


TCTGTTGCAA 


ATTTTAATTT 


AACCTTTTCA 


TGGCCTATGA 


GCAAAGTAAA 




2501 


TATCGTGCAG 


TCACTCAGTG 


CTTATCCTTT 


AAACAGCAGT 


TGTGTGATTG 
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2551 TTTCCTGGAT ACTATCACCC AGTGATTACA AGCTAATGTA TTTTATTATT 

2601 GAGTGGAAAA ATCTTAATGA AGATGGT GAA ATAAAATGGC TTAGAATCTC 

2651 TTCATCTGTT AAGAAGTATT ATATCCATGA TCATTTTATC CCCATTGAGA 

2701 AGTACCAGTT CAGTCTTTAC CCAATATTTA TGGAAGGAGT GGGAAAACCA 

2751 AAGATAATTA AT AGTT TCAC TCAAGATGAT ATTGAAAAAC ACCAGAGTGA 

2801 TGCAGGTGAC TACAAGGACG ACGATGACAA GTAGGGATCC AGACATGATA 

2851 AGATACATTG ATGAGTTTGG ACAACCCACA ACTAGAATGC AGTGAAAAAA 

2901 ATGCTTTATT TGTGAAATTT GTGATGCTAT TGCTTTATTT GTAACCAT 
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Recombinant Human OB Receptor Protein. Natural Splice Vari* n f 
amino ar?ri sequence (Sea. TP, No. 13) 



5 


1 


MICQKFCWL 


LHWEFIYVIT 


AFNLSYPITP 


WRFKLSCMPP 


NSTYDYFLLP 




51 


AGLSKNTSNS 


NGHYETAVEP 


KFNSSGTHFS 


NLSKTTFHCC 


FRSEQDRNCS 


10 


101 


LCADNIEGKT 


FVSTVNSLVF 


QQIDANWNIQ 


CWLKGDLKLF 


ICYVESLFKN 


151 


LFRNYNYKVH 


LLYVLPEVLE 


DSPLVPQKGS 


FQMVHCNCSV 


HECCECLVPV 




201 


PTAKLNDTLL 


MCLKITSGGV 


IFQSPLMSVQ 


PINMVKPDPP 


LGLHMEITDD 


15 


251 


GNLKISWSSP 


PLVPFPLQYQ 


VKYSENSTTV 


IREADKIVSA 


TSLLVDSILP 




301 


GSSYEVQVRG 


KRLDGPGIWS 


DWSTPRVFTT 


QDVIYFPPKI 


LTSVGSNVSF 


20 


351 


HCIYKKENKI 


VPSKEIVWWM 


NLAEKIPQSQ 


YDWSDHVSK 


VTFFNLNETK 


401 


PRGKFTYDAV 


YCCNEHECHH 


RYAELYVIDV 


NINISCETDG 


YLTKMTCRWS 




451 


TSTIQSLAES 


TLQLRYHRSS 


LYCSDIPSIH 


PISEPKDCYL 


QSDGFYECIF 


25 


501 


QPIFLLSGYT 


MWIRINHSLG 


SLDSPPTCVL 


PDSWKPLPP 


SSVKAEITIN 




551 


IGLLKISWEK 


PVFPENNLQF 


OIRYGLSGKE 


VOWKMYEVYD 


AKSKSVSLPV 


30 


601 


PDLCAVYAVQ 


VRCKRLDGLG 


YWSNWSNPAY 


TWMDIKVPM 


RGPEFWRIIN 


651 


GDTMKKEKNV 


TLLWKPLMKN 


DSLCSVQRYV 


INHHTSCNGT 


WSEDVGNHTK 




.701 


FTFLWTEQAH 


TVTVLAINSI 


GASVANFNLT 


FSWPMSKVNI 


VQSLSAYPLN 


35 


751 
801 


SSCVIVSWIL 
FTIL 


SPSDYKLMYF 


IIEWKNLNED 


GEIKWLRISS 


SVKKYYIHGK 
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Human OB Rprpotnr Prof.ein. Natural SnU CP Vflriflnt , DNft (Sfifl . If> 

NO. 14) 

1 GCGGCCGCCA GTGTGATGGA TATCTGCAGA ATTCGGCTTT CTCTGCCTTC 

51 GGTCGAGTTG GACCCCCGGA TCAAGGTGTA CTTCTCTGAA GTAAGATGAT 
101 TTGTCAAAAA TTCTGTGTGG TTTTGTTACA TTGGGAATTT ATTTATGTGA 
151 TAACTGCGTT TAACTTGTCA TATCCAATTA CTCCTTGGAG ATTTAAGTTG 
201 TCTTGCATGC CACCAAATTC AACCTATGAC TACTTCCTTT TGCCTGCTGG 
251 GCTCTCAAAG AATACT TCAA ATTCGAATGG ACATTATGAG ACAGCTGTTG 
301 AACCTAAGTT TAATTCAAGT GGTACTCACT TTTCTAACTT ATCCAAAACA 
351 ACTTTCCACT GTTGCTTTCG GAGTGAGCAA GATAGAAACT GCTCCTTATG 
401 TGCAGACAAG ATTGAAGGAA AGACATTTGT TTCAACAGTA AATTCTTTAG 
451 TTTTTCAACA AATAGATGCA AACTGGAACA TACAGTGCTG GCTAAAAGGA 
501 GACT TAAAAT TATTCATCTG TTATGTGGAG TCATTATTTA AGAATCTATT 
551 CAGGAATTAT AACTATAAGG TCCATCTTTT ATATGTTCTG CCTGAAGTGT 
601 TAGAAGATTC ACCTCTGGTT CCCCAAAAAG GCAGTTTTCA GATGGTTCAC 
651 TGCAATTGCA GTGTTCACGA ATGTTGTGAA TGTCTTGTGC CTGTGCCAAC 
701 AGCCAAACTC AACGACACTC TCCTTATGTG TTTGAAAATC ACATCTGGTG 
751 GAGTAATTTT CCAGTCACCT CTAATGTCAG TTCAGCCCAT AAATATGGTG 
801 AAGCCTGATC CACCATTAGG TTTGCATATG GAAATCACAG ATGATGGTAA 
851 TTTAAAGATT TCTTGGTCCA GCCCACCATT GGTACCATTT CCACTTCAAT 
901 ATCAAGTGAA ATATTCAGAG AATTCTACAA CAGTTATCAG AGAAGCTGAC 
951 AAGATTGTCT CAGCTACATC CCTGCTAGTA GACAGTATAC TTCCTGGGTC 
1001 TTCGTATGAG GTTCAGGTGA GGGGCAAGAG ACTGGATGGC CCAGGAATCT 
1051 GGAGTGACTG GAGTACTCCT CGTGTCTTTA CCACACAAGA TGTCATATAC 
1101 TTTCCACCTA AAATTCTGAC AAGTGTTGGG TCTAATGTTT CTTTTCACTG 
1151 CATCTATAAG AAGGAAAACA AGATTGTTCC CTCAAAAGAG ATTGTTTGGT 
1201 GGATGAATTT AGCTGAGAAA AT TCC T C AAA GCCAGTATGA TGTTGTGAGT 
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1251 
1301 
5 1351 
1401 
1451 

10 

1501 
1551 
15 1601 
1651 
1701 

20 

1751 
1801 
25 1851 
1901 
1951 

30 

2001 
2051 
35 2101 
2151 
2201 

40 

2251 
2301 
45 2351 
2401 
2451 

50 

2501 



GATCATGTTA 
AGGAAAGTTT 
ATCATCGCTA 
TGTGAAACTG 
TACAATCCAG 
GCAGCCTTTA 
AAAGATTGCT 
AATCTTCCTA 
TAGGTTCACT 
AAGCCACTGC 
ATTATTGAAA 
AATTCCAGAT 
TATGAGGTTT 
CTTGTGTGCA 
TGGGATATTG 
ATAAAAGTTC 
TACTATGAAA 
AAAATGACTC 
TCCTGCAATG 
TTTCCTGTGG 
CAATTGGTGC 
AGCAAAGTAA 
TTGTGTGATT 
ATTTTATTAT 
CTTAGAATCT 
TATACTT 



GCAAAGTTAC 
ACCTATGATG 
TGCTGAATTA 
ATGGGTACTT 
TCACTTGCGG 
CTGTTCTGAT 
ATTTGCAGAG 
TTATCTGGCT 
TGACTCTCCA 
CTCCATCCAG 
ATATCTTGGG 
TCGCTATGGT 
ATGATGCAAA 
GTCTATGCTG 
GAGTAATTGG 
CTATGAGAGG 
AAGGAGAAAA 
ATTGTGCAGT 
GAACATGGTC 
ACAGAGCAAG 
TTCTGTTGCA 
ATATCGTGCA 
GTTTCCTGGA 
TGAGTGGAAA 
CTTCATCTGT 
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TTTTTTCAAT 
CAGTGTACTG 
TATGTGATTG 
AACTAAAATG 
AAAGCACTTT 
ATTCCATCTA 
TGATGGTTTT 
ACACAATGTG 
CCAACATGTG 
TGTGAAAGCA 
AAAAGCCAGT 
TTAAGTGGAA 
ATCAAAATCT 
TTCAGGTGCG 
AGCAATCCAG 
ACCTGAATTT 
ATGTCACTTT 
GTTCAGAGAT 
AGAAGATGTG 
CACATACTGT 
AATTTTAATT 
GTCACTCAGT 
TACTATCACC 
AATCTTAATG 
TAAGAAGTAT 



CTGAATGAAA 
CTGCAATGAA 
ATGTCAATAT 
ACTTGCAGAT 
GCAATTGAGG 
TTCATCCCAT 
TATGAATGCA 
GATTAGGATC 
TCCTTCCTGA 
GAAATTACTA 
CTTTCCAGAG 
AAGAAGTACA 
GTCAGTCTCC 
CTGTAAGAGG 
CCTACACAGT 
TGGAGAATAA 
ACTTTGGAAG 
ATGTGATAAA 
GGAAATCACA 
TACGGTTCTG 
TAACCTTTTC 
GCTTATCCTT 
CAGTGATTAC 
AAGATGGTGA 
TATATCCATG 



CCAAACCTCG 
CATGAATGCC 
CAATATCTCA 
GGTCAACCAG 
TATCATAGGA 
ATCTGAGCCC 
TTTTCCAGCC 
AATCACTCTC 
TTCTGTGGTG 
TAAACATTGG 
AATAACCTTC 
ATGGAAGATG 
CAGTTCCAGA 
CTAGATGGAC 
TGTCATGGAT 
TTAATGGAGA 
CCCCTGATGA 
CC AT CAT AC T 
CGAAATTCAC 
GCCATCAATT 
ATGGCCTATG 
TAAACAGCAG 
AAGCTAATGT 
AATAAAATGG 
GTAAGTTTAC 
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While the present invention has been described 
in terms of preferred embodiments, it is understood that 
variations and modifications will occur to those skilled 
in the art. Therefore, it is intended that the appended 
claims cover all such equivalent variations which come 
within the scope of the invention as claimed. 
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15 



SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: CHANG, MING-SHI 

WELCHER, ANDREW A. 
FLETCHER, FREDERICK A. 

<ii) TITLE OF INVENTION: OB PROTEIN RECEPTOR AND RELATED 
COMPOSITIONS AND METHODS 

(iii) NUMBER OF SEQUENCES: 33 



(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Antgen Inc. 

(B) STREET: 1840 Dehavilland Drive 

(C) CITY: Thousand Oaks 
20 (D) STATE: California 

(2) COUNTRY: USA 
(F) ZIP: 91320 

<v) COMPUTER READABLE FORM: 
25 (A) MEDIUM TYPE: Floppy disk 

(3) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30 

30 (vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 
(3) FILING DATE: 
(C) CLASSIFICATION: 

35 (viii) ATTORNEY / AGENT INFORMATION: 

<A) NAME: Pessin, Karol M. 
<C) REFERENCE / DOCKET NUMBER: A— 3 82— A 



40 (2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 965 amino acids 

(B) TYPE: amino acid 

45 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



50 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

Met lie Cys Gin Lys Phe Cys Val Val Leu Leu His Trp Glu Phe lie 
1 5 10 15 

Tyr Val lie Thr Ala Phe Asn Leu Ser Tyr Pro lie Thr Pro Trp Arg 
20 25 30 

Phe Lys Leu Ser Cys Met Pro Pro Asn Ser Thr Tyr Asp Tyr Phe Leu 
35 40 45 

Leu Pro Ala Gly Leu Ser Lys Asn Thr Ser Asn Ser Asn Gly His Tyr 
SO 55 60 

Glu Thr Ala Val Glu Pro Lys Phe Asn Ser Ser Gly Thr His Phe Ser 
65 70 75 30 

Asn Leu Ser Lys Thr Thr Phe His Cys Cys Phe Arg Ser Glu Gin Asp 
85 90 95 

Arg Asn Cys Ser Leu Cys Ala Asp Asn lie Glu Gly Lys Thr Phe Val 
100 105 HO 

Ser Thr Val Asn Ser Leu Val Phe Gin Gin lie Asp Ala Asn Trp Asn 
115 120 125 

lie Gin Cys Trp Leu Lys Gly Asp Leu Lys Leu Phe lie Cys Tyr Val 
130 135 140 

Glu Ser Leu Phe Lys Asn Leu Phe Arg Asn Tyr Asn Tyr Lys Val His 
145 150 155 160 

Leu Leu Tyr Val Leu Pro Glu Val Leu Glu Asp Ser Pro Leu Val Pro 
165 170 "5 

Gin Lys Gly Ser Phe Gin Met Val His Cys Asn Cys Ser Val His Glu 
180 185 190 

Cys Cys Glu Cys Leu Val Pro Val Pro Thr Ala Lys Leu Asn Asp Thr 
195 200 205 

Leu Leu Met Cys Leu Lys He Thr Ser Gly Gly Val He Phe Gin Ser 
210. 215 220 

Pro Leu Met Ser Val Gin Pro He Asn Met Val Lys Pro Asp Pro Pro 
225 230 235 240 

Leu Gly Leu His Met Glu He Thr Asp Asp Gly Asn Leu Lys He Ser 
245 250 255 

Trp Ser Ser Pro Pro Leu Val Pro Phe Pro Leu Gin Tyr Gin Val Lys 
260 265 270 
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Tyr Ser Glu Asn Ser Thr Thr Val He Arg Glu Ala Asp Lys He Val 
275 280 285 

Ser Ala Thr Ser Leu Leu Val Asp Ser He Leu Pro Gly Ser Ser Tyr 
290 295 300 

Glu Val Gin Val Arg Gly Lys Arg Leu Asp Gly Pro Gly He Trp Ser 
305 310 315 320 

Asp Trp Ser Thr Pro Arg Val Phe Thr Thr Gin Asp Val He Tyr Phe 
325 330 335 



Pro Pro Lys He Leu Thr Ser Val Gly Ser Asn Val Ser Phe His Cys 

15 340 345 350 

He Tyr Lys Lys Glu Asn Lys He Val Pro Ser Lys ' Glu He Val Trp 

355 360 365 

20 Trp Met Asn Leu Ala Glu Lys He Pro Gin Ser Gin Tyr Asp Val Val 

370 375 380 



Ser Asp His Val Ser Lys Val Thr Phe Phe Asn Leu Asn Glu Thr Lys 
385 390 395 400 

Pro Arg Gly Lys Phe Thr Tyr Asp Ala Val Tyr Cys Cys Asn Glu His 
405 410 415 



Glu Cys His His Arg Tyr Ala Glu Leu Tyr Val He Asp Val Asn He 
30 420 425 430 

Asn He Ser Cys Glu Thr Asp Gly Tyr Leu Thr Lys Met Thr Cys Arg 
435 440 445 

35 Trp Ser Thr Ser Thr He Gin Ser Leu Ala Glu Ser Thr Leu Gin Leu 

450 455 460 

Arg Tyr His Arg Ser Ser Leu Tyr Cys Ser Asp He Pro Ser He His 
465 470 475 480 



Pro He Ser Glu Pro Lys Asp Cys Tyr Leu Gin Ser Asp Gly Phe Tyr 
485 490 495 



Glu Cys He Phe Gin Pro He Phe Leu Leu Ser Gly Tyr Thr Met Trp 
45 500 505 510 

lie Arg He Asn His Ser Leu Gly Ser Leu Asp Ser Pro Pro Thr Cys 
515 520 525 

50 Val Leu Pro Asp Ser Val Val Lys Pro Leu Pro Pro Ser Ser Val Lys 

530 535 540 
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Ala Glu He Thr He Asn He Gly Leu Leu Ly3 He Ser Trp Glu Lys 
545 550 555 560 

Pro Val Phe Pro Glu Asn Asn Leu Gin Phe Gin He Arg Tyr Gly Leu 
565 570 575 

Ser Gly Lys Glu Val Gin Trp Lys Met Tyr Glu Val Tyr Asp Ala Lys 
580 585 590 

Ser Lys Ser Val Ser Leu Pro Val Pro Asp Leu Cys Ala Val Tyr Ala 
595 600 605 

Val Gin Val Arg Cys Lys Arg Leu Asp Gly Leu Gly Tyr Trp Ser Asn 
610 615 620 

Trp Ser Asn Pro Ala Tyr Thr Val Val Met Asp He Lys Val Pro Met 
625 630 635 640 

Arg Gly Pro Glu Phe Trp Arg lie He Asn Gly Asp Thr Met Lys Lys 
645 650 655 

Glu Lys Asn Val Thr Leu Leu Trp Lys Pro Leu Met Lys Asn Asp Ser 
660 665 670 

Leu Cys Ser Val Gin Arg Tyr Val He Asn His His Thr Ser Cys Asn 
675 680 685 

Gly Thr Trp Ser Glu Asp Val Gly Asn His Thr Lys Phe Thr Phe Leu 
690 695 700 

Trp Thr Glu Gin Ala His Thr Val Thr Val Leu Ala He Asn Ser He 
705 710 715 720 

Gly Ala Ser Val Ala Asn Phe Asn Leu Thr Phe Ser Trp Pro Met Ser 
725 730 735 

Lys Val Asn He Val Gin Ser Leu Ser Ala Tyr Pro Leu Asn Ser Ser 
740 745 750 

Cys Val He Val Ser Trp He Leu Ser Pro Ser Asp Tyr Lys Leu Met 
755 760 765 

Tyr Phe He He Glu Trp Lys Asn Leu Asn Glu Asp Gly Glu He Lys 
770 775 7B0 

Trp Leu Arg He Ser Ser Ser Val Lys Lys Tyr Tyr He His Asp His 
785 790 795 800 

Phe He Pro He Glu Lys Tyr Gin Phe Ser Leu Tyr Pro He Phe Met 
805 810 815 

Glu Gly Val Gly Lys Pro Lys He He Asn Ser Phe Thr Gin Asp Asp 
820 825 030 
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He Glu Lys His Gin Ser Asp Ala Gly Leu Tyr Val He Val Pro Val 

835 840 845 

He lie Ser Ser Ser He Leu Leu Leu Gly Thr Leu Leu He Ser His 

850 855 860 . 

Gin Arg Met Lys Lys Leu Phe Trp Glu Asp Val Pro Asn Pro Lys Asn 

865 870 875 880 

Cys Ser Trp Ala Gin Gly Leu Asn Phe Gin Lys Arg Thr Asp He Leu 
885 890 895 



Ser Leu He Met He Thr Thr Asp Glu Pro Asn Val Pro Thr Ser Gin 
15 900 905 910 

Gin Ser He Glu Tyr Lys He Phe Thr Phe Arg Arg Gly Ala Asn Leu 
915 920 925 

20 Lys Lys He Gin Leu Asn Phe Glu Leu Thr Tyr Gly Gly Leu Cys Phe 

930 935 940 

Arg Thr Asn Arg Cys Val Asn Leu Gly Ser Lys Cys Arg Phe Glu Ser 
945 950 955 960 



Ser Leu Asp Val Leu 
965 

<2) INFORMATION FOR SEQ ID NO : 2 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3193 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
35 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

40 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: 
CCGCCGCCAT CTCTGCCTTC GGTCGAGTTG GACCCCCGGA TCAAGGTGTA CTTCTCTGAA 60 

45 

GTAAGATGAT TTGTCAAAAA TTCTGTGTGG TTTTGTTACA TTGGGAATTT ATTTATGTGA 120 
TAACTGCGTT TAACTTGTCA TATCCAATTA CTCCTTGGAG ATTTAAGTTG TCTTGCATGC 180 
50 CACCAAATTC AACCTATGAC TACTTCCTTT TGCCTGCTGG ACTCTCAAAG AATACTTCAA 240 
ATTCGAATGG ACATTATGAG ACAGCTGTTG AACCTAAGTT TAATTCAAGT GGTACTCACT 300 
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TTTCTAACTT ATCCAAAACA ACTTTCCACT GTTGCTTTCG GAGTGAGCAA GATAGAAACT 360 

GCTCCTTATG TGCAGACAAC AT T G AAGGAA AGACATTTGT TTCAACAGTA AATTCTTTAG 420 

TTTTTCAACA AATAGATGCA AACTGGAACA TACAGTGCTG GCTAAAAGGA GACTTAAAAT 480 

TATTCATCTG TTATGTGGAG TCATTATTTA AGAATCTATT CAGGAATTAT AACTATAAGG 540 

TCCATCTTTT ATATGTTCTG CCTGAAGTGT TAGAAGATTC ACCTCTGGTT CCCCAAAAAG 600 

GCAGTTTTCA GATGGTTCAC TGCAATTGCA GTGTTCATGA ATGTTGTGAA TGTCTTGTGC 660 

CTGTGCCAAC AGCCAAACTC AACGACACTC TCCTTATGTG TTTGAAAATC ACATCTGGTG 720 

GAGTAATTTT CCAGTCACCT CTAATGTCAG TTCAGCCCAT AAATATGGTG AAGCCTGATC 780 

CACCATTAGG TTTGCATATG GAAATCACAG ATGATGGTAA TTTAAAGATT TCTTGGTCCA 840 

GCCCACCATT GGTACCATTT CCACTTCAAT ATCAAGTGAA ATATTCAGAG AATTCTACAA 900 

CAGTTATCAG AGAAGCTGAC AAGATTGTCT CAGCTACATC CCTGCTAGTA GACAGTATAC 960 

TTCCTGGGTC TTCGTATGAG GTTCAGGTGA GGGGCAAGAG ACTGGATGGC CCAGGAATCT 1020 

GGAGTGACTG GAGTACTCCT CGTGTCTTTA CCACACAAGA TGTCATATAC TTTCCACCTA 1080 

AAATTCTGAC AAGTGTTGGG TCTAATGTTT CTTTTCACTG CATCTATAAG AAGGAAAACA 1140 

AGATTGTTCC CTCAAAAGAG ATTGTTTGGT GGATGAATTT AGCTGAGAAA ATTCCTCAAA 1200 

GCCAGTATGA TGTTGTGAGT GATCATGTTA GCAAAGTTAC TTTTTTCAAT CTGAATGAAA 1260 

CCAAACCTCG AGGAAAGTTT ACCTATGATG CAGTGTACTG CTGCAATGAA CATGAATGCC 1320 

ATCATCGCTA TGCTGAATTA TATGTGATTG ATGTCAATAT CAATATCTCA TGTGAAACTG 1380 

ATGGGTACTT AACTAAAATG ACTTGCAGAT GGTCAACCAG TACAATCCAG TCACTTGCGG 1440 

AAAGC AC TTT GCAATTGAGG TATCATAGGA GCAGCCTTTA CTGTTCTGAT ATTCCATCTA 1500 

TTCATCCCAT ATCTGAGCCC AAAGATTGCT ATTTGCAGAG TGATGGTTTT TATGAATGCA 1560 

TTTTCCAGCC AATCTTCCTA TTATCTGGCT ACACAATGTG GATTAGGATC AATCACTCTC 1620 

TAGGTTCACT TGACTCTCCA CCAACATGTG TCCTTCCTGA TTCTGTGGTG AAGCCACTGC 1680 

CTCCATCCAG TGTGAAAGCA GAAATTACTA TAAACATTGG ATTATTGAAA AT ATCT TGGG 1740 

AAAAGCCAGT CTTTCCAGAG AATAACCTTC AATTCCAGAT TCGCTATGGT TTAAGTGGAA 1800 
AAGAAGT AC A ATGGAAGATG TATGAGGTTT ATGATGCAAA ATCAAAATCT GTCAGTCTCC 1860 
CAGTTCCAGA CTTGTGTGCA GTCTATGCTG TTCAGGTGCG CTGTAAGAGG CTAGATGGAC 1920 
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TGGGATATTG GAGTAATTGG AGCAATCCAG CCTACACAGT TGTCATGGAT ATAAAAGTTC 1980 

CTATGAGAGG ACCTGAATTT TGGAGAATAA TTAATGGAGA TACTATGAAA AAGGAGAAAA 2040 

5 

ATGTCACTTT ACTTTGGAAG CCCCTGATGA AAAATGACTC ATTGTGCAGT GTTCAGAGAT 2100 

ATGTGATAAA CCATCATACT TCCTGCAATG GAACATGGTC AGAAGATGTG GGAAATCACA 2160 

10 CGAAATTCAC TTTCCTGTGG ACAGAGCAAG CACATACTGT TACGGTTCTG GCCATCAATT 2220 

CAATTGGTGC TTCTGTTGCA AATTTTAATT TAACCTTTTC ATGGCCTATG AGCAAAGTAA 2280 

ATATCGTGCA GTCACTCAGT GCTTATCCTT TAAACAGCAG TTGTGTGATT GTTTCCTGGA 2340 

15 

TACTATCACC CAG TG AT T AC AAGCTAATGT ATTTTATTAT TGAG7GGAAA AATCTTAATG 2400 

AAGATGGTGA AATAAAATGG CTTAGAATCT CTTCATCTGT TAAGAAGTAT TATATCCATG 2460 

20 ATCATTTTAT CCCCATTGAG AAGTACCAGT TCAGTCTTTA CCCAATATTT ATGGAAGGAG 2520 

TGGGAAAACC AAAGATAATT AATAGTTTCA CTCAAGATGA TATTGAAAAA CACCAGAGTG 2580 

ATGCAGGTTT ATATGTAATT GTGCCAGTAA . TTATTTCCTC TTCCATCTTA TTGCTTGGAA 2640 

25 

CATTATTAAT ATCACACCAA AGAATGAAAA AGCTATTTTG GGAAGATGTT CCGAACCCCA 2700 

AGAATTGTTC CTGGGCACAA GGACTTAATT TTCAGAAGAG AACGGACATT CTTTGAAGTC 2760 

30 TAATCATGAT CACTACAGAT GAACCCAATG TGCCAACTTC CCAACAGTCT ATAGAGTATT 2820 

AGAAGATTTT TACATTTTGA AGAAGGGGAG CAAATCTAAA AAAAATTCAG TTGAACTTCT 2880 

GAGAGTTAAC ATATGGTGGA TTATGTTGAT TTAGAAC7TA AAATAGATGT GTAAATTTGG 2 940 

GTTCAAAATG TAGATTTGAG TCCAGTTTGG ATGTGTGATT AATTTTCAAA TCATCTAAAG 3000 

TTTAAAAGTA GTATTCATGA TTTCTGGCTT TTGATTTGCC ATATTCCTGG TCATAAAACA 3060 

4 0 TTAAGAAAAT TATGGCTGTT GCTGTCATTA CATATCTATT AAATGTCATC AAATATGTAG 3120 

TAGACAATTT TGTAATTAGG TGAACTCTAA AACTGCAACA TCTGACAAAT TGCTTTAAAA 3180 

ATACAATGAT TAT 3193 
<2) INFORMATION FOR SEQ ID NO: 3: 



45 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 995 amino acids 
50 <B) TYPE: amino acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inea r 
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(ii) MOLECULE TYPE: protein 



<xi> SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

Met He Cys Gin Lys Phe Cys Val Val Leu Leu His Trp Glu Phe lie 
15 10 15 

Tyr Val He Thr Ala Phe Asn Leu Ser Tyr Pro He Thr Pro Trp Arg 
20 25 30 

Phe Lys Leu Ser Cys Met Pro Pro Asn Ser Thr Tyr Asp Tyr Phe Leu 
35 40 45 

Leu Pro Ala Gly Leu Ser Lys Asn Thr Ser Asn Ser Asn Gly His Tyr 
50 55 60 

Glu Thr Ala Val Glu Pro Lys Phe Asn Ser Ser Gly Thr His Phe Ser 
65 70 75 80 

Asn Leu Ser Lys Thr Thr Phe His Cys Cys Phe Arg Ser Glu Gin Asp 
85 90 95 

Arg Asn Cys Ser Leu Cys Ala Asp Asn He Glu Gly Lys Thr Phe Val 
100 105 HO 

Ser Thr Val Asn Ser Leu Val Phe Gin Gin He Asp Ala Asn Trp Asn 
115 120 125 

He Gin Cys Trp Leu Lys Gly Asp Leu Lys Leu Phe He Cys Tyr Val 
130 135 140 

Glu Ser Leu Phe Lys Asn Leu Phe Arg Asn Tyr Asn Tyr Lys Val His 
145 150 155 160 

Leu Leu Tyr Val Leu Pro Glu Val Leu Glu Asp Ser Pro Leu Val Pro 
165 170 175 

Gin Lys Gly Ser Phe Gin Met Val His Cys Asn Cys Ser Val His Glu 
180 185 190 

Cys Cys Glu Cys Leu Val Pro Val Pro Thr Ala Lys Leu Asn Asp Thr 
195 200 205 

Leu Leu Met Cys Leu Lys He Thr Ser Gly Gly Val He Phe Gin Ser 
210 215 220 

Pro Leu Met Ser Val Gin Pro He Asn Met Val Lys Pro Asp Pro Pro 
225 230 235 240 
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Leu Gly Leu His Met Glu lie Thr Asp Asp Gly Asn Leu Lys lie Ser 
245 250 255 

Trp Ser Ser Pro Pro Leu Val Pro Phe Pro Leu Gin Tyr Gin Val Lys 
260 265 270 

Tyr Ser Glu Asn Ser Thr Thr Val lie Arg Glu Ala Asp Lys lie Val 
275 280 285 

Ser Ala Thr Ser Leu Leu Val Asp Ser lie Leu Pro Gly Ser Ser Tyr 
290 295 300 

Glu Val Gin Val Arg Gly Lys Arg Leu Asp Gly Pro Gly lie Trp Ser 
305 310 315 320 

Asp Trp Ser Thr Pro Arg Val Phe Thr Thr Gin Asp Val lie Tyr Phe 
325 330 335 

Pro Pro Lys lie Leu Thr Ser Val Gly Ser Asn Val Ser Phe His Cys 
340 345 350 

lie Tyr Lys Lys Glu Asn Lys lie Val Pro Ser Lys Glu lie Val Trp 
355 360 365 

Trp Met Asn Leu Ala Glu Lys lie Pro Gin Ser Gin Tyr Asp Val Val 
370 375 380 

Ser Asp His Val Ser Lys Val Thr Phe Phe Asn Leu Asn Glu Thr Lys 
385 390 395 400 

Pro Arg Gly Lys Phe Thr Tyr Asp Ala Val Tyr Cys Cys Asn Glu His 
405 410 415 

Glu Cys His His Arg Tyr Ala Glu Leu Tyr Val lie Asp Val Asn lie 
420 425 430 

Asn lie Ser Cys Glu Thr Asp Gly Tyr Leu Thr Lys Met Thr Cys Arg 
435 440 445 

Trp Ser Thr Ser Thr lie Gin Ser Leu Ala Glu Ser Thr Leu Gin Leu 
450 455 460 

Arg Tyr His Arg Ser Ser Leu Tyr Cys Ser Asp lie Pro Ser lie His 
465 470 475 480 

Pro lie Ser Glu Pro Lys Asp Cys Tyr Leu Gin Ser Asp Gly Phe Tyr 
485 490 495 

Glu Cys lie Phe Gin Pro lie Phe Leu Leu Ser Gly Tyr Thr Met Trp 
500 505 510 

lie Arg lie Asn His Ser Leu Gly Ser Leu Asp Ser Pro Pro Thr Cys 
515 520 525 
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Val Leu Pro Asp Ser Val Val Lys Pro Leu Pro Pro Ser Ser Val Lys 
530 535 540 

Ala Glu lie Thr He Asn He Gly Leu Leu Lys He Ser Trp Glu Lys 
545 550 555 560 

Pro Val Phe Pro Glu Asn Asn Leu Gin Phe Gin He Arg Tyr Gly Leu 
565 570 575 

Ser Gly Lys Glu Val Gin Trp Lys Met Tyr Glu Val Tyr Asp Ala Lys 
580 585 590 

Ser Lys Ser Val Ser Leu Pro Val Pro Asp Leu Cys Ala Val Tyr Ala 
595 600 605 

Val Gin Val Arg Cys Lys Arg Leu Asp Gly Leu Gly Tyr Trp Ser Asn 
610 615 620 

Trp Ser Asn Pro Ala Tyr Thr Val Val Met Asp He Lys Val Pro Met 
625 630 635 640 

Arg Gly Pro Glu Phe Trp Arg He He Asn Gly Asp Thr Met Lys Lys 
645 650 655 

Glu Lys Asn Val Thr Leu Leu Trp Lys Pro Leu Met Lys Asn Asp Ser 
660 665 670 

Leu Cys Ser Val Gin Arg Tyr Val He Asn His His Thr Ser Cys Asn 
675 680 685 

Gly Thr Trp Ser Glu Asp Val Gly Asn His Thr Lys Phe Thr Phe Leu 
690 695 700 

Trp Thr Glu Gin Ala His Thr Val Thr Val Leu Ala He Asn Ser He 
705 710 715 720 

Gly Ala Ser Val Ala Asn Phe Asn Leu Thr Phe Ser Trp Pro Met Ser 
725 730 735 

Lys Val Asn He Val Gin Ser Leu Ser Ala Tyr Pro Leu Asn Ser Ser 
740 745 750 

Cys Val He Val Ser Trp He Leu Ser Pro Ser Asp Tyr Lys Leu Met 
755 760 765 

Tyr Phe He He Glu Trp Lys Asn Leu Asn Glu Asp Gly Glu He Lys 
770 775 780 

Trp Leu Arg He Ser Ser Ser Val Lys Lys Tyr Tyr He His Asp His 
785 790 795 800 
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Phe lie Pro lie Glu Lys Tyr Gin Phe Ser Leu Tyr Pr lie Phe Met 
805 810 815 

Glu Gly Val Gly Lys Pro Lys lie lie Asn Ser Phe Thr Gin Asp Asp 
5 820 825 830 

lie Glu Lys His Gin Ser Asp Ala Gly Leu Tyr Val lie Val Pro Val 
835 840 845 

10 lie lie Ser Ser Ser lie Leu Leu Leu Gly Thr Leu Leu lie Ser His 

850 855 860 



15 



30 



Gin Arg Met Lys Lys Leu Phe Trp Glu Asp Val Pro Asn Pro Lys Asn 

865 870 875 880 

Cys Ser Trp Ala Gin Gly Leu Asn Phe Gin Lys Lys Arg Leu Ser lie 

885 890 895 



Phe Leu Ser Ser lie Gin His Gin His Val Val Leu Phe Phe Trp Ser 
20 900 905 910 

Leu Lys Gin Phe Gin Lys lie Ser Val Leu lie His His Gly Lys lie 
915 920 925 

25 Lys Met Arg Cys Gin Gin Leu Trp Ser Leu Tyr Phe Gin Gin Gin He 

930 935 940 



Leu Lys Arg Val Leu Phe Val Leu Val Thr Ser Ser Thr Val Leu Thr 

945 950 955 960 

Ser Leu Arg Leu Arg Val Leu Arg Pro Met Arg Thr Lys Ala Arg Asp 

965 970 975 



Asn Pro Leu Leu Asn Thr Pro Arg Ser Ala Thr Leu Asn Gin Val Lys 
35 980 985 990 

Leu Val Lys 
995 

40 (2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3063 base pairs 
<B) TYPE: nucleic acid 
45 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 



50 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 
CCGCCGCCAT CTCTGCCTTC GGTCGAGTTG GACCCCCGGA TCAAGGTGTA CTTCTCTGAA 
GTAAGATGAT TTGTCAAAAA TTCTGTGTGG TTTTGTTACA TTGGGAATTT ATTTATGTGA 
TAACTGCGTT TAACTTGTCA TATCCAATTA CTCCTTGGAG ATTTAAGTTG TCTTGCATGC 
CACCAAATTC AACCTATGAC TACTTCCTTT TGCCTGCTGG ACTCTCAAAG AATACTTCAA 
ATTCGAATGG ACATTATGAG ACAGCTGTTG AACCTAAGTT TAATTCAAGT GGT ACTCACT 
TTTCTAACTT ATCCAAAACA ACTTTCCACT GTTGCTTTCG GAGTGAGCAA GATAGAAACT 
GCTCCTTATG TGCAGACAAC ATTGAAGGAA AGACATTTGT TTCAACAGTA AAT TCTTT AG 
TTTTTCAACA AATAGATGCA AACTGGAACA TACAGTGCTG GCTAAAAGGA GACTTAAAAT 
TATTCATCTG TTATGTGGAG TCATTATTTA AGAATCTATT CAGGAATTAT AACTATAAGG 
TCCATCTTTT ATATGTTCTG CCTGAAGTGT TAGAAGATTC ACCTCTGGTT CCCCAAAAAG 
GCAGTTTTCA GATGGTTCAC TGCAATTGCA GTGTTCATGA ATGTTGTGAA TGTCTTGTGC 
CTGTGCCAAC AGCCAAACT C AACGACACTC TCCTTATGTG TTTGAAAATC ACATCTGGTG 
GAGTAATTTT CCAGTCACCT CTAATGTCAG TTCAGCCCAT AAATATGGTG AAGCCTGATC 
CACCATTAGG TTTGCATATG GAAATCACAG ATGATGGTAA TTTAAAGATT TCTTGGTCCA 
GCCCACCATT GGTACCATTT CCACTTCAAT ATCAAGTGAA ATATTCAGAG AATTCTACAA 
CAGTTATCAG AGAAGCTGAC AAGATTGTCT CAGCTACATC CCTGCTAGTA GACAGTATAC 
TTCCTGGGTC TTCGTATGAG GTTCAGGTGA GGGGCAAGAG ACTGGATGGC CCAGGAATCT 
GGAGTGACTG GAGTACTCCT CGTGTCTTTA CCACACAAGA TGTCATATAC TTTCCACCTA 
AAATTCTGAC AAGTGTTGGG TCTAATGTTT CTTTTCACTG CATCTATAAG AAGGAAAACA 
AGATTGTTCC CTCAAAAGAG ATTGTTTGGT GGATGAATTT AGCTGAGAAA ATTCCTCAAA 
GCCAGTATGA TGTTGTGAGT GATCATGTTA GCAAAGTTAC TTTTTTCAAT CTGAATGAAA 
CCAAACCTCG AGGAAAGTTT ACCTATGATG CAGTGTACTG CTGCAATGAA CATGAATGCC 
ATC ATCGCT A TGCTGAATTA TATGTGATTG ATGTCAATAT CAATATCTCA TGTGAAACTG 
ATGGGTACTT AACTAAAATG ACTTGCAGAT GGTCAACCAG TACAATCCAG TCACTTGCGG 
AAAGCACTTT GCAATTGAGG TATCATAGGA GCAGCCTTTA CTGTTCTGAT ATTCCATCTA 
TTCATCCCAT ATCTGAGCCC AAAGATTGCT ATTTGCAGAG TGATGGTTTT TATGAATGCA 



60 
120 

180 

240 

300 - 

360 

420 

480 

540 

600 
660 
720 
780 
840 
900 
960 

1020 

1080 

1140 

1200 

1260 
1320 
1380 
1440 
1500 
1560 
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TTTTCCAGCC AATCTTCCTA TTATCTGGCT 
TAGGTTCACT TGACTCTCCA CCAACATGTG 

5 

CTCCATCCAG TGTGAAAGCA GAAATTACTA 
AAAAGCCAGT CTTTCCAGAG AATAACCTTC 
10 AAGAAGTACA ATGGAAGATG TATGAGGTTT 
CAGTTCCAGA CTTGTGTGCA GTCTATGCTG 
TGGGATATTG GAGTAATTGG AGCAATCCAG 

15 

CTATGAGAGG ACCTGAATTT TGGAGAATAA 
ATGTCACTTT ACTTTGGAAG CCCCTGATGA 
20 ATGTGATAAA CCATCATACT TCCTGCAATG 
CGAAATTCAC TTTCCTGTGG ACAGAGCAAG 
CAATTGGTGC TTCTGTTGCA AATTTTAATT 

25 

ATATCGTGCA GTCACTCAGT GCTTATCCTT 
TACTATCACC C AG TG ATT AC AAGCTAATGT 
30 AAGATGGTGA AATAAAATGG CTTAGAATCT 
ATCATTTTAT CCCCATTGAG AAGTACCAGT 
TGGGAAAACC AAAGATAATT AATAGTTTCA 

35 

ATGCAGGTTT ATATGTAATT GTGCCAGTAA 
CATTATTAAT ATCACACCAA AGAATGAAAA 
40 AGAATTGTTC CTGGGCACAA GGACTTAATT 
CAAGCATACA GCATCAGTGA CATGTGGTCC 
AGATATCAGT GTTGATACAT CATGGAAAAA 

45 

CTCTCTACTT TCAACAACAG ATCTTGAAAA 
CAGTGTTAAC TTCTCTGAGG CTGAGGGTAC 
50 ACAACCCTTT GTTAAATACG CCACGCTGAT 
AGA 



105 - 

ACACAATGTG GATTAGGATC AATCACTCTC 1620 

TCCTTCCTGA TTCTGTGGTG AAGCCACTGC 1680 

TAAACATTGG ATTATTGAAA ATATCTTGGG 1740 

AATTCCAGAT TCGCTATGGT TTAAGTGGAA 1800 

ATGATGCAAA ATCAAAATCT GTCAGTCTCC 1860 

TTCAGGTGCG CTGTAAGAGG CTAGATGGAC 1920 

CCTACACAGT TGTCATGGAT ATAAAAGTTC 1980 

TTAATGGAGA TACTATGAAA AAGGAGAAAA 2040 

AAAATGACTC ATTGTGCAGT GTTCAGAGAT 2100 

GAACATGGTC AGAAGATGTG GGAAATCACA 2160 

CACATACTGT TACGGTTCTG GCCATCAATT 2220 

TAACCTTTTC ATGGCCTATG AGCAAAGTAA 2280 

TAAACAGCAG TTGTGTGATT GTTTCCTGGA 2340 

ATTTTATTAT TGAGTGGAAA AATCTTAATG 2400 

CTTCATCTGT T AAGAAGT AT TATATCCATG 2460 

TCAGTCTTTA CCCAATATTT ATGGAAGGAG 2520 

CTCAAGATGA TATTGAAAAA CACCAGAGTG 2580 

TTATTTCCTC TTCCATCTTA TTGCTTGGAA 2640 

AGCTATTTTG GGAAGATGTT CCGAACCCCA 2700 

TTCAGAAGAA ACGTTTGAGC ATCTTTTTAT 2760 

TCTTCTTTTG GAGCCTGAAA CAATTTCAGA 2820 

TAAAGATGAG ATGATGCCAA CAACTGTGGT 2880 

GGGTTCTGTT TGTTTTAGTG ACCAGTTCAA 2940 

TGAGGTAACC TATGAGGACG AAAGCCAGAG 3000 

CAGCAACTCT AAACCAAGTG AAACTGGTGA 3060 

3063 
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(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 969 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Met He Cys Gin Lys Phe Cys Val Val Leu Leu His Trp Glu Phe lie 
x 5 10 15 

Tyr Val He Thr Ala Phe Asn Leu Ser Tyr Pro He Thr Pro Trp Arg 
20 25 30 

Phe Lys Leu Ser Cys Met Pro Pro Asn Ser Thr Tyr Asp Tyr Phe Leu 
35 40 45 

Leu Pro Ala Gly Leu Ser Lys Asn Thr Ser Asn Ser Asn Gly His Tyr 
50 55 60 

Glu Thr Ala Val Glu Pro Lys Phe Asn Ser Ser Gly Thr His Phe Ser 
65 70 75 BO 

Asn Leu Ser Lys Thr Thr Phe ^His ^,Cy3 ^fys Phe Arg Ser Glu Gin Asp 
B5 - 90 95 

Arg Asn Cys Ser Leu Cys Ala Asp Asn He Glu Gly Lys Thr Phe Val 
100 105 HO 

Ser Thr Val Asn Ser Leu Val Phe Gin Gin He Asp Ala Asn Trp Asn 
115 120 125 

He Gin Cys Trp Leu Lys Gly Asp Leu Lys Leu Phe He Cys Tyr Val 
130 135 140 

Glu Ser Leu Phe Lvs Asn Leu Phe Arg Asn Tyr Asn Tyr Lys Val His 



WO 97/25424 



PCT/US97/00128 



- 107 - 



Leu Leu Met Cys 
210 

5 Pro Leu Met Ser 

225 

Leu Gly Leu His 

10 

Trp Ser Ser Pro 
260 

Tyr Ser Glu Asn 
15 275 

Ser Ala Thr Ser 
290 

20 Glu Val Gin Val 

305 



Leu Lys lie Thr Ser Gly 
215 

Val Gin Pro lie Asn Met 
230 

Met Glu lie Thr Asp Asp 
245 250 

Pro Leu Val Pro Phe Pro 
265 

Ser Thr Thr Val lie Arg 
280 

Leu Leu Val Asp Ser lie 
295 

Arg Gly Lys Arg Leu Asp 

-310 



Gly val lie Phe Gin Ser 
220 

Val Lys Pro Asp Pro Pro 
235 240 

Gly Asn Leu Lys lie Ser 
255 

Leu Gin Tyr Gin Val Lys 
270 

Glu Ala Asp Lys lie Val 
285 

Leu Pro Gly Ser Ser Tyr 
300 

Gly Pro Gly lie Trp Ser 
315 320 



25 



Asp Trp Ser Thr Pro Arg Val Phe Thr Thr Gin Asp Val He Tyr Phe 
325 330 335 

Pro Pro Lys He Leu Thr Ser Val Gly Ser Asn Val Ser Phe His Cys 
340 345 350 



He Tyr Lys Lys Glu Asn Lys He Val Pro Ser Lys Glu He Val Trp, 
30 355 360 365 

Trp Met Asn Leu Ala Glu Lys lie Pro Gin -Ser -Gin Tyr Asp Val Val 
370 375 380 

35 Ser Asp His Val Ser Lys Val Thr Phe Phe Asn Leu Asn Glu Thr Lys 

385 390 395 400 



40 



Pro Arg Gly Lys Phe Thr Tyr Asp Ala Val Tyr Cys Cys Asn Glu His 
405 410 415 

Glu Cys His His Arg Tyr Ala Glu Leu Tyr Val He Asp Val Asn He 
420 425 430 



Asn He Ser Cys Glu Thr Asp Gly Tyr Leu Thr Lys Met Thr Cys Arg 

45 435 440 445 

Trp Ser Thr Ser Thr He Gin Ser Leu Ala Glu Ser Thr Leu Gin Leu 
450 455 460 



50 



Arg Tyr His Arg Ser Ser Leu Tyr Cys Ser Asp He Pro Ser lie His 
465 470 475 480 
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Pro He Ser Glu Pro Lys Asp Cys Tyr Leu Gin Ser Asp Gly Phe Tyr 
485 490 495 

Glu Cys He Phe Gin Pro He Phe Leu Leu Ser Gly Tyr Thr Met Trp 
500 505 510 

He Arg He Asn His Ser Leu Gly Ser Leu Asp Ser Pro Pro Thr Cys 
515 520 525 

Val Leu Pro Asp Ser Val Val Lys Pro Leu Pro Pro Ser Ser Val Lys 
530 535 540 

Ala Glu He Thr lie. Asn He Gly Leu Leu Lys He Ser Trp Glu Lys 
545 550 555 560 

Pro Val Phe Pro Glu Asn Asn Leu Gin Phe Gin He Arg Tyr Gly Leu 
565 570 575 

Ser Gly Lys Glu Val Gin Trp Lys Met Tyr Glu Val Tyr Asp Ala Lys 
580 585 590 

Ser Lys Ser Val Ser Leu Pro Val Pro Asp Leu Cys Ala Val Tyr Ala 
595 600 605 

Val Gin Val Arg Cys Lys Arg Leu Asp Gly Leu Gly Tyr Trp Ser Asn 
610 615 620 

Trp Ser Asn Pro Ala Tyr Thr Val Val Met Asp He Lys Val Pro Met 
625 630 635 640 

Arg Gly Pro Glu Phe Trp Arg He He Asn Gly Asp Thr Met Lys Lys 
645 650 655 

Glu Lys Asn Val Thr Leu Leu Trp Lys Pro Leu Met Lys Asn Asp Ser 
660 665 670 

Leu Cys Ser Val Gin Arg Tyr Val He Asn His His Thr Ser Cys Asn 
675 680 685 

Gly Thr Trp Ser Glu Asp Val Gly Asn His Thr Lys Phe Thr Phe Leu 
690 695 700 

Trp Thr Glu Gin Ala His Thr Val Thr Val Leu Ala He Asn Ser He 
705 710 715 720 

Gly Ala Ser Val Ala Asn Phe Asn Leu Thr Phe Ser Trp Pro Met Ser 
725 730 735 

Lys Val Asn He Val Gin Ser Leu Ser Ala Tyr Pro Leu Asn Ser Ser 
740 745 750 

Cys Val He Val Ser Trp He Leu Ser Pro Ser Asp Tyr Lys Leu Met 
755 760 765 
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10 



25 



40 



Tyr Phe He He Glu Trp Lys Asn Leu Asn Glu Asp Gly Glu He Lys 
770 775 780 

Trp L u Arg He Ser S r Ser Val Lys Lys Tyr Tyr He His Asp His 
785 790 795 800 

Phe He Pro He Glu Lys Tyr Gin Phe Ser Leu Tyr Pro He Phe Met 
805 810 815 

Glu Gly Val Gly Lys Pro Lys He He Asn Ser Phe Thr Gin Asp Asp 
820 825 830 



lie Glu Lys His Gin Ser Asp Ala Gly Leu Tyr Val He Val Pro Val 

15 835 840 845 

He He Ser Ser Ser He Leu Leu Leu Gly Thr Leu Leu He Ser His 

850 855 860 

20 Gin Arg Met Lys Lys Leu Phe Trp Glu Asp Val Pro Asn Pro Lys Asn 

865 870 87S 880 



Cys Ser Trp Ala Gin Gly Leu Asn Phe Gin Lys Met Leu Glu Gly Ser 

885 890 895 

Met Phe Val Lys Ser His His His Ser Leu He Ser Ser Thr Gin Gly 
900 905 910 



His Lys His Cys Gly Arg Pro Gin Gly Pro Leu His Arg Lys Thr Arg 

30 915 920 925 

Asp Leu Cys Ser Leu Val Tyr Leu Leu Thr Leu Pro Pro Leu Leu Ser 

930 935 940 

35 Tyr Asp Pro Ala Lys Ser Pro Ser Val Arg Asn Thr Gin Glu Ser He 

945 950 955 960 



Lys Lys Lys Lys Lys Lys Leu Glu Gly 
965 

(2) INFORMATION FOR SEQ ID NO: 6: 



(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 969 amino acids 
45 (B) TYPE: amino acid 

(C) S T RAND ED NESS : single 

(D) TOPOLOGY: linear 



<ii> MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Met He Cy3 Gin Lys Phe Cys Val Val Leu Leu His Trp Glu Phe He 
1 S~ 10 15 

Tyr Val He Thr Ala Phe Aan Leu Ser Tyr Pro He Thr Pro Trp Arg 
20 25 30 

Phe Lys Leu Ser Cys Met Pro Pro Asn Ser Thr Tyr Asp Tyr Phe Leu 
35 40 45 

Leu Pro Ala Gly Leu Ser Lys Asn Thr Ser Asn Ser Asn Gly His Tyr 
50 55 60 

Glu Thr Ala Val Glu Pro Lys Phe Asn Ser Ser Gly Thr His Phe Ser 
65 70 75 80 

Asn Leu Ser Lys Thr Thr Phe His Cys Cys Phe Arg Ser Glu Gin Asp 
85 ' 90 95 

Arg Asn Cys Ser Leu Cys Ala Asp Asn He Glu Gly Lys Thr Phe Val 
100 105 HO 

Ser Thr Val Asn Ser Leu Val Phe Gin Gin He Asp Ala Asn Trp Asn 
115 120 125 

He Gin Cys Trp Leu Lys Gly Asp Leu Lys Leu Phe He Cys Tyr Val 
130 135 140 

Glu Ser Leu Phe Lys Asn Leu Phe Arg Asn Tyr Asn Tyr Lys Val His 
145 150 155 160 

Leu Leu Tyr Val Leu Pro Glu Val Leu Glu Asp Ser Pro Leu Val Pro 
165 170 175 

Gin Lys Gly Ser Phe Gin Met Val His Cys Asn Cys Ser Val His Glu 
180 185 190 

Cys Cys Glu Cys Leu Val Pro Val Pro Thr Ala Lys Leu Asn Asp Thr 
195 200 205 

Leu Leu Met Cys Leu Lys He Thr Ser Gly Gly Val He Phe Gin Ser 
210 215 220 

Pro Leu Met Ser Val Gin Pro He Asn Met Val Lys Pro Asp Pro Pro 
225 230 235 240 

Leu Gly Leu His Met Glu He Thr Asp Asp Gly Asn Leu Lys He Ser 
245 250 - 255 

Trp Ser Ser Pro Pro Leu Val Pro Phe Pro Leu Gin Tyr Gin Val Lys 
260 265 270 
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Tyr Ser Glu Asn Ser Thr Thr Val lie Arg Glu Ala Asp Lys lie Val 
275 260 285 

Ser Ala Thr Ser Leu Leu Val Asp Ser lie Leu Pro Gly Ser Ser Tyr 
5 290 295 300 

Glu Val Gin Val Arg Gly Lys Arg Leu Asp Gly Pro Gly lie Trp Ser 
305 310 315 320 

10 Asp Trp Ser Thr Pro Arg Val Phe Thr Thr Gin Asp Val lie Tyr Phe 

325 330 335 



15 



30 



45 



Pro Pro Lys lie Leu Thr Ser Val Gly Ser Asn Val Ser Phe His Cys 
340 345 350 

lie Tyr Lys Lys Glu Asn Lys lie Val Pro Ser Lys Glu lie Val Trp 
355 360 365 



Trp Met Asn Leu Ala Glu Lys lie Pro Gin Ser Gin Tyr Asp Val Val 

20 370 375 380 

Ser Asp His Val Ser Lys Val Thr Phe Phe Asn Leu Asn Glu Thr Lys 

385 390 395 400 

25 Pro Arg Gly Lys Phe Thr Tyr Asp Ala Val Tyr Cys Cys Asn Glu His 

405 410 415 



Glu Cys His His Arg Tyr Ala Glu Leu Tyr Val lie Asp Val Asn lie 

420 425 430 

Asn tie Ser Cys Glu Thr Asp Gly Tyr Leu Thr Lys Met Thr Cys Arg 

435 440 445 



Trp Ser Thr Ser Thr lie Gin Ser Leu Ala Glu Ser Thr Leu Gin Leu 
35 450 455 460 

Arg Tyr His Arg Ser Ser Leu Tyr Cys Ser Asp He Pro Ser He His 
465 470 475 480 

40 Pro He Ser Glu Pro Lys Asp Cys Tyr Leu Gin Ser Asp Gly Phe Tyr 

485 490 495 



Glu Cys He Phe Gin Pro He Phe Leu Leu Ser Gly Tyr Thr Met Trp 
500 505 510 

He Arg He Asn His Ser Leu Gly Ser Leu Asp Ser Pro Pro Thr Cys 
515 520 525 



Val Leu Pro Asp Ser Val Val Lys Pro Leu Pro Pro Ser Ser Val Lys 
50 530 535 540 



Ala Glu He Thr He Asn He Gly Leu Leu Lys He Ser Trp Glu Lys 
545 550 555 560 
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Pro Val Phe Pro Glu Asn Asn Leu Gin Phe Gin lie Arg Tyr Gly Leu 
565 570 575 

Ser Gly Lys Glu Val Gin Trp Lys Met Tyr Glu Val Tyr Asp Ala Lys 
580 585 590 

Ser Lys Ser Val Ser Leu Pro Val Pro Asp Leu Cys Ala Val Tyr Ala 
595 600 605 

Val Gin Val Arg Cys Lys Arg Leu Asp Gly Leu Gly Tyr Trp Ser Asn 
610 615 620 

Trp Ser Asn Pro Ala Tyr Thr Val Val Met Asp He Lys val Pro Met 
6 25 630 635 640 

Arg Gly Pro Glu Phe Trp Arg He He Asn Gly Asp Thr Met Lys Lys 
645 650 655 

Glu Lys Asn Val Thr Leu Leu Trp Lys Pro Leu Met Lys Asn Asp Ser 
660 665 670 

Leu Cys Ser Val Gin Arg Tyr Val He Asn His His Thr Ser Cys Asn 
675 680 685 

Gly Thr Trp Ser Glu Asp Val Gly Asn His Thr Lys Phe Thr Phe Leu 
690 695 700 

Trp Thr Glu Gin Ala His Thr Val Thr Val Leu Ala He Asn Ser He 
705 710 715 720 

Gly Ala Ser Val Ala Asn Phe Asn Leu Thr Phe Ser Trp Pro Met Ser 
725 730 735 

Lys Val Asn He Val Gin Ser Leu Ser Ala Tyr Pro Leu Asn Ser Ser 
740 745 750 

Cys Val He Val Ser Trp He Leu Ser Pro Ser Asp Tyr Lys Leu Met 
755 760 765 

Tyr Phe He He Glu Trp Lys Asn Leu Asn Glu Asp Gly Glu He Lys 
770 775 780 

Trp Leu Arg He Ser Ser Ser Val Lys Lys Tyr Tyr He His Asp His 
785 790 795 800 

Phe He Pro He Glu Lys Tyr Gin Phe Ser Leu Tyr Pro He Phe Met 
805 810 815 

Glu Gly Val Gly Lys Pro Lys He He Asn Ser Phe Thr Gin Asp Asp 
820 825 830 
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He Glu Lys His Gin Ser Asp Ala Gly Leu Tyr Val He Val Pro Val 
835 840 845 

He lie Ser Ser Ser He Leu Leu Leu Gly Thr Leu Leu He Ser His 
850 855 860 

Gin Arg Met Lys Lys Leu Phe Trp Glu Asp Val Pro Asn Pro Lys Asn 
865 870 875 880 

Cys Ser Trp Ala Gin Gly Leu Asn Phe Gin Lys Met Leu Glu Gly Ser 
885 890 895 

Met Phe Val Lys Ser His His His Ser Leu He Ser Ser Thr Gin Gly 
900 905 910 

His Lys His Cys Gly Arg Pro Gin Gly Pro Leu His Arg Lys Thr Arg 
915 920 925 

Asp Leu Cys Ser Leu Val Tyr Leu Leu Thr Leu Pro Pro Leu Leu Ser 
930 935 940 

Tyr Asp Pro Ala Lys Ser Pro Ser Val Arg Asn Thr Gin Glu Ser He 
945 950 955 960 

Lys Lys Lys Lys Lys Lys Leu Glu Gly 
965 

(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1216 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 

Met He Cys Gin Lys Phe Cys Val Val Leu Leu His Trp Glu Phe He 

15 10 15 

Tyr Val lie Thr Ala Phe Asn Leu Ser Tyr Pro He Thr Pro Trp Arg 
20 25 30 

Phe Lys Leu Ser Cys Met Pro Pro Asn Ser Thr Tyr Asp Tyr Phe Leu 
35 40 45 

Leu Pro Ala Gly Leu Ser Lys Asn Thr Ser Asn Ser Asn Gly His Tyr 

50 55 60 
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Glu Thr Ala Val Glu Pro Lys Phe Asn Ser Ser Gly Thr His Phe Ser 
65 70 75 80 

Asn Leu Ser Lys Thr Thr Phe His Cys Cys Phe Arg Ser Glu Gin Asp 
85 90 95 

Arg Asn Cys Ser Leu Cys Ala Asp Asn He Glu Gly Lys Thr Phe Val 
100 105 HO 

Ser Thr Val Asn Ser Leu Val Phe Gin Gin He Asp Ala Asn Trp Asn 
115 120 125 

He Gin Cys Trp Leu Lys Gly Asp Leu Lys Leu Phe He Cys Tyr Val 
130 135 140 

Glu Ser Leu Phe Lys Asn Leu Phe Arg Asn Tyr Asn Tyr Lys Val His 
145 150 155 160 

Leu Leu Tyr Val Leu Pro Glu Val Leu Glu Asp Ser Pro Leu Val Pro 
165 170 175 

Gin Lys Gly Ser Phe Gin Met Val His Cys Asn Cys Ser Val His Glu 
180 185 190 

Cys Cys Glu Cys Leu Val Pro Val Pro Thr Ala Lys Leu Asn Asp Thr 
195 200 205 

Leu Leu Met Cys Leu Lys He Thr Ser Gly Gly Val He Phe Gin Ser 
210 215 220 

Pro Leu Met Ser Val Gin Pro He Asn Met Val Lys Pro Asp Pro Pro 
225 230 235 240 

Leu Gly Leu His Met Glu He Thr Asp Asp Gly Asn Leu Lys He Ser 
245 250 255 

Trp Ser Ser Pro Pro Leu Val Pro Phe Pro Leu Gin Tyr Gin Val Lys 
260 265 270 

Tyr Ser Glu Asn Ser Thr Thr Val He Arg Glu Ala Asp Lys lie Val 
275 280 285 

Ser Ala Thr Ser Leu Leu Val Asp Ser He Leu Pro Gly Ser Ser Tyr 
290 295 300 

Glu Val Gin Val Arg Gly Lys Arg Leu Asp Gly Pro Gly He Trp Ser 
305 310 315 320 

Asp Trp Ser Thr Pro Arg Val Phe Thr Thr Gin Asp Val He Tyr Phe 
325 330 335 
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Pro Pro Lys lie Leu Thr Ser Val Gly Ser Asn Val Ser Phe His Cys 
340 345 350 

lie Tyr Lys Lys Glu Asn Lys He Val Pro Ser Lys Glu He Val Trp 
5 355 360 365 

Trp Met Asn Leu Ala Glu Lys He Pro Gin Ser Gin Tyr Asp Val Val 
370 375 360 

10 Ser Asp His Val Ser Lys Val Thr Phe Phe Asn Leu Asn Glu Thr Lys 

385 390 395 400 



15 



30 



45 



Pro Arg Gly Lys Phe Thr Tyr Asp Ala Val Tyr Cys Cys Asn Glu His 
405 410 415 

Glu Cys His His Arg Tyr Ala Glu Leu Tyr Val He Asp Val Asn He 
420 425 430 



Asn lie Ser Cys Glu Thr Asp Gly Tyr Leu Thr Lys Met Thr Cys Arg 

20 435 440 445 

Trp Ser Thr Ser Thr lie Gin Ser Leu Ala Glu Ser Thr Leu Gin Leu 
450 455 460 

25 Arg Tyr His Arg Ser Ser Leu Tyr Cys Ser Asp He Pro Ser He His 

465 470 475 480 



Pro He Ser Glu Pro Lys Asp Cys Tyr Leu Gin Ser Asp Gly Phe Tyr 
485 490 495 

Glu Cys He Phe Gin Pro He Phe Leu Leu Ser Gly Tyr Thr Met Trp 
500 505 510 



He Arg He Asn His Ser Leu Gly Ser Leu Asp Ser Pro Pro Thr Cys 

35 515 520 525 

Val Leu Pro Asp Ser Val Val Lys Pro Leu Pro Pro Ser Ser Val Lys 

530 535 540 

40 Ala Glu He Thr He Asn He Gly Leu Leu Lys He Ser Trp Glu Lys 

545 550 555 560 



Pro Val Phe Pro Glu Asn Asn Leu Gin Phe Gin He Arg Tyr Gly Leu 
565 570 575 

Ser Gly Lys Glu Val Gin Trp Lys Met Tyr Glu Val Tyr Asp Ala Lys 
580 585 590 



Ser Lys Ser Val Ser Leu Pro Val Pro Asp Leu Cys Ala Val Tyr Ala 

50 595 600 605 

Val Gin Val Arg Cys Lys Arg Leu Asp Gly Leu Gly Tyr Trp Ser Asn 

610 615 620 



WO 97/25424 



PCT/US97/00128 



- 116 - 



Trp Ser Asn Pro Ala Tyr Thr Val Val Met Asp He Lys Val Pro Met 
625 630 635 640 

Arg Gly Pro Glu Phe Trp Arg He He Asn Gly Asp Thr Met Lys Lys 
645 650 655 

Glu Lys Asn Val Thr Leu Leu Trp Lys Pro Leu Met Lys Asn Asp Ser 
660 665 670 

Leu Cys Ser Val Gin Arg Tyr Val He Asn His His Thr Ser Cys Asn 
675 680 685 

Gly Thr Trp Ser Glu Asp Val Gly Asn His Thr Lys Phe Thr Phe Leu 
690 695 700 

Trp Thr Glu Gin Ala His Thr Val Thr Val Leu Ala He Asn Ser lie 
705 7 " 715 720 

Gly Ala Ser Val Ala Asn Phe Asn' Leu Thr Phe Ser Trp Pro Met Ser 
725 730 735 

Lys Val Asn He Val Gin Ser Leu Ser Ala Tyr Pro Leu Asn Ser Ser 
740 745 750 

Cys Val He Val Ser Trp He Leu Ser Pro Ser Asp Tyr Lys Leu Met 
755 760 765 

Tyr Phe He He Glu Trp Lys Asn Leu Asn Glu Asp Gly Glu He Lys 
770 775 780 

Trp Leu Arg He Ser Ser Ser Val Lys Lys Tyr Tyr He His Asp His 
785 790 795 800 

Phe He Pro He Glu Lys Tyr Gin Phe Ser Leu Tyr Pro He Phe Met 
805 810 815 

Glu Gly Val Gly Lys Pro Lys He He Asn Ser Phe Thr Gin Asp Asp 
820 825 830 

He Glu Lys His Gin Ser Asp Ala Gly Leu Tyr Val He Val Pro Val 
835 840 845 

He He Ser Ser Ser He Leu Leu Leu Gly Thr Leu Leu He Ser His 
850 855 860 

Gin Arg Met Lys Lys Leu Phe Trp Glu Asp Val Pro Asn Pro Lys Asn 
865 870 875 880 

Cys Ser Trp Ala Gin Gly Leu Asn Phe Gin Lys Pro Glu Thr Phe Glu 
885 890 895 
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His Leu Phe lie Lys His Thr Ala Ser Val Thr Cys Gly Pro Leu Leu 
900 905 910 

Leu Glu Pro Glu Thr lie Ser Glu Asp lie Ser Val Asp Thr Ser Trp 
915 920 925 

Lys Asn Lys Asp Glu Met Met Pro Thr Thr Val Val Ser Leu Leu Ser 
930 935 940 

Thr Thr Asp Leu Glu Lys Gly Ser Val Cys lie Ser Asp Gin Phe Asn 
945 950 955 960 

Ser Val Asn Phe Ser Glu Ala Glu Gly Thr Glu Val Thr Tyr Glu Asp 
965 970 975 

Glu Ser Gin Arg Gin Pro Phe Val Lys Tyr Ala Thr Leu lie Ser Asn 
980 985 990 

Ser Lys Pro Ser Glu Thr Gly Glu Glu Gin Gly Leu lie Asn Ser Ser 
995 1000 1005 

Val Thr Lys Cys Phe Ser Ser Lys Asn Ser Pro Leu Lys Asp Ser Phe 
1010 1015 1020 

Ser Asn Ser Ser Trp Glu He' Glu Ala Gin Ala Phe Phe He Leu Ser 
1025 1030 1035 1040 

Asp Gin His Pro Asn He He Ser Pro His Leu Thr Phe Ser Glu Gly 
1045 1050 1055 

Leu Asp Glu Leu Leu Lys Leu Glu Gly Asn Phe Pro Glu Glu Asn Asn 
1060 .. 1065 1070 

Asp Lys Lys Ser He Tyr Tyr Leu Gly Val Thr Ser He Lys Lys Arg 
1075 1080 1085 

Glu Ser Gly Val Leu Leu Thr Asp Lys Ser Arg Val Ser Cys Pro Phe 
1090 1095 1100 

Pro Ala Pro Cys Leu Phe Thr Asp He Arg Val Leu Gin Asp Ser Cys 
1105 1110 1115 1120 

Ser His Phe Val Glu Asn Asn lie Asn Leu Gly Thr Ser Ser Lys Lys 
1125 1130 1135 

Thr Phe Ala Ser Tyr Met Pro Gin Phe Gin Thr Cys Ser Thr Gin Thr 
1140 1145 1150 

His Lys He Met Glu Asn Lys Met Cys Asp Leu Thr Val Phe His Arg 
1155 1160 1165 

Asn Leu Gin He Cys Val He Met Gly Asn He Lys Cys Asn Arg Leu 
1170 1175 1180 
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Leu Trp Val Gly Glu Arg Lys Glu Thr Arg Val Lys Phe Glu Asn Asn 
1185 H90 1195 1200 

Cys Ser Lys Lys Lys Lys Lys Lys Asn Ser Arg Pro Ala Arg Pro Asp 
1205 1210 1215 



(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3599 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

GCGGCCGCCA GTGTGATGGA TATCTGCAGA ATTCGGCTTT CTCTGCCTTC GGTCGAGTTG 60 

GACCCCCGGA TCAAGGTGTA CTTCTCTGAA GTAAGATGAT TTGTCAAAAA TTCTGTGTGG 120 

TTTTGTTACA TTGGGAATTT ATTTATGTGA TAACTGCGTT TAACTTGTCA TATCCAATTA 180 

CTCCTTGGAG ATTTAAGTTG TCTTGCATGC CACCAAATTC AACCTATGAC TACTTCCTTT 240 

TGCCTGCTGG GCTCTCAAAG AATACTTCAA ATTCGAATGG ACATTATGAG ACAGCTGTTG 300 

AACCTAAGTT TAATTCAAGT GGTACTCACT TTTCTAACTT ATCCAAAACA ACTTTCCACT 360 

GTTGCTTTCG GAGTGAGCAA GATAGAAACT GCTCCTTATG TGCAGACAAC ATTGAAGGAA 420 

AGACATTTGT TTCAACAGTA AATTCTTTAG TTTTTCAACA AATAGATGCA AACTGGAACA 480 

TACAGTGCTG GCTAAAAGGA GACTTAAAAT TATTCATCTG TTATGTGGAG TCATTATTTA 540 

AGAATCTATT CAGGAATTAT AACTATAAGG TCCATCTTTT ATATGTTCTG CCTGAAGTGT 600 

TAGAAGATTC ACCTCTGGTT CCCCAAAAAG GCAGTTTTCA GATGGTTCAC TGCAATTGCA 660 

GTGTTCACGA ATGTTGTGAA TGTCTTGTGC CTGTGCCAAC AGCCAAACTC AACGACACTC 720 

TCCTTATGTG TTTGAAAATC ACATCTGGTG GAGTAATTTT CCAGTCACCT CTAATGTCAG 780 

TTCAGCCCAT AAATATGGTG AAGCCTGATC CACCATTAGG TTTGCATATG GAAATCACAG 840 

ATGATGGTAA TTTAAAGATT TCTTGGTCCA GCCCACCATT GGTACCATTT CCACTTCAAT 900 
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ATCAAGTGAA ATATTCAGAG AATTCTACAA 
CAGCTACATC CCTGCTAGTA GACAGTATAC 
5 GGGGCAAGAG ACTGGATGGC CCAGGAATCT 
CCACACAAGA TGTCATATAC TTTCCACCTA 
CTTTTCACTG CATCTATAAG AAGGAAAACA 

10 

GGATGAATTT AGCTGAGAAA ATTCCTCAAA 
GCAAAGTTAC TTTTTTCAAT CTGAATGAAA 
15 CAGTGTACTG CTGCAATGAA CATGAATGCC 
ATGTCAATAT CAATATCTCA TGTGAAACTG 
GGTCAACCAG TACAATCCAG TCACTTGCGG 

20 

GCAGCCTTTA CTGTTCTGAT ATTCCATCTA 
ATTTGCAGAG TGATGGTTTT TATGAATGCA 
25 ACACAATGTG GATTAGGATC AATCACTCTC 
TCCTTCCTGA 7TCTGTGGTG AAGCCACTGC 
TAAACATTGG ATT AT TG AAA ATATCTTGGG 

30 

AATTCCAGAT TCGCTATGGT TTAAGTGGAA 
ATGATGCAAA ATCAAAATCT GTCAGTCTCC 
35 TTCAGGTGCG CTGTAAGAGG CTAGATGGAC 
CCTACACAGT TGTCATGGAT ATAAAAGTTC 
TTAATGGAGA TACTATGAAA AAGGAGAAAA 

40 

AAAATG ACT C ATTGTGCAGT GTTCAGAGAT 
GAACATGGTC AGAAGATGTG GGAAATCACA 
45 CACATACTGT TACGGTTCTG GCCATCAATT 
TAACCTTTTC ATGGCCTATG AGCAAAGTAA 
TAAACAGCAG TTGTGTGATT GTTTCCTGGA 

50 

ATTTTATTAT TGAGTGGAAA AATCTTAATG 
CTTCATCTGT TAAGAAGTAT TATATCCATG 



119 - 

CAGTTATCAG AGAAGCTGAC AAGATTGTCT 960 

TTCCTGGGTC TTCGTATGAG GTTCAGGTGA 1020 

GGAGTGACTG GAGTACTCCT CGTGTCTTTA 1080 

AAATTCTGAC AAGTGTTGGG TCTAATGTTT 1140 

AGATTGTTCC CTCAAAAGAG ATTGTTTGGT 1200 

GCCAGTATGA TGTTGTGAGT GATCATGTTA 1260 

CCAAACCTCG AGGAAAGTTT ACCTATGATG 1320 

ATCATCGCTA TGCTGAATTA TATGTGATTG 1380 

ATGGGTACTT AACT AAAATG ACTTGCAGAT 1440 

AAAGCACTTT GCAATTGAGG TATCATAGGA 1500 

TTCATCCCAT ATCTGAGCCC AAAGATTGCT 1560 

TTTTCCAGCC AATCTTCC T A TTATCTGGCT 1620 

TAGGTTCACT TGACTCTCCA CCAACATGTG 1680 

CTCCATCCAG TGTGAAAGCA G AAAT TACT A 17 40 

AAAAGCCAGT CTTTCCAGAG AATAACCTTC 1800 

AAGAAGT AC A ATGGAAGATG TATGAGGTTT 1860 

CAGTTCCAGA CTTGTGTGCA GTCTATGCTG 1920 

TGGGATATTG GAGTAATTGG AGCAATCCAG 1980 

CTATGAGAGG ACCTGAATTT TGGAGAATAA 2040 

ATGTCACTTT ACTTTGGAAG CCCCTGATGA 2100 

ATGTGATAAA CCATCATACT TCCTGCAATG 2160 

CGAAATTCAC TTTCCTGTGG ACAGAGCAAG 2220 

CAATTGGTGC TTCTGTTGCA AATT TTAATT 2280 

ATATCGTGCA GTCACTCAGT GCTTATCCTT 2340 

TACTATCACC CAGTGATTAC AAGC T AATGT 2400 

AAGATGGTGA AATAAAATGG CTTAGAATCT 24 60 

ATCATTTTAT CCCCATTGAG AAGTACCAGT 2520 
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TCAGTCTTTA CCCAATATTT ATGGAAGGAG TGGGAAAACC AAAGATAATT AATAGTTTCA 

CTCAAGATGA TATTGAAAAA CACCAGAGTG ATGCAGGTTT ATATGTAATT GTGCCAGTAA 

TTATTTCCTC TTCCATCTTA TTGCTTGGAA CATTATTAAT ATCACACCAA AGAATGAAAA 

AGCTATTTTG GGAAGATGTT CCGAACCCCA AGAATTGTTC CTGGGCACAA GGACTTAATT 

TTCAGAAGCC AGAAACGTTT GAGCATCTTT TTATCAAGCA TACAGCATCA GTGACATGTG 

GTCCTCTTCT TTTGGAGCCT GAAACAATTT CAGAAGATAT CAGTGTTGAT ACATCATGGA 

AAAATAAAGA TGAGATGATG CCAACAACTG TGGTCTCTCT ACTTTCAAC A ACAGATCTTG 

AAAAGGGTTC TGTTTGTATT AGTGACCAGT TCAACAGTGT TAACTTCTCT GAGGCTGAGG 

GTACTGAGGT AACCTATGAG GACGAAAGCC AGAGACAACC CTTTGTTAAA TACGCCACGC 

TGATCAGCAA CTCTAAACCA AGTGAAACTG GTGAAGAACA AGGGCTTATA AATAGTTCAG 

TCACCAAGTG CTTCTCTAGC AAAAATTCTC CGTTGAAGGA TTCTTTCTCT AATAGCTCAT 

GGGAGATAGA GGCCCAGGCA TTTTTTATAT TATCGGATCA GCATCCCAAC ATAATTTCAC 

CACACCTCAC ATTCTCAGAA GGATTGGATG AACTTTTGAA ATTGGAGGGA AATTTCCCTG 

AAGAAAATAA TGATAAAAAG TCTATCTATT ATTTAGGGGT CACCTCAATC AAAAAGAGAG 

AGAGTGGTGT GCTTTTGACT GACAAGTCAA GGGTATCGTG CCCATTCCCA GCCCCCTGTT 

TATTCACGGA CATCAGAGTT CTCCAGGACA GTTGCTCACA CTTTGTAGAA AATAATATCA 

ACTTAGGAAC TTCTAGTAAG AAGACTTTTG CATCTTACAT GCCTCAATTC CAAACTTGTT 

CTACTCAGAC TCATAAGATC ATGGAAAACA AGATGTGTGA CCTAACTGTG TAATCTAGA 

<2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 17 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: cDNA 



2560 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3599 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
NNNNNTACCT TTTCCAG 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 839 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Met lie Cys Gin Lys Phe Cys Val Val Leu Leu His Trp Glu Phe lie 
1 5 10 15 

Tyr Val He Thr Ala Phe Asn Leu Ser Tyr Pro He Thr Pro Trp Arg 
20 25 30 

Phe Lys Leu Ser Cys Met Pro Pro Asn Ser Thr Tyr Asp Tyr Phe Leu 
35 40 45 

Leu Pro Ala Gly Leu Ser Lys Asn Thr Ser Asn Ser Asn Gly His Tyr 
50 55 60 

Glu Thr Ala Val Glu. Pro Lys Phe Asn Ser Ser Gly Thr His Phe Ser 
65 70 75 80 

Asn Leu Ser Lys Thr Thr Phe His Cys Cys Phe Arg Ser Glu Gin Asp 
85 90 95 

Arg Asn Cys Ser Leu Cys Ala Asp Asn He Glu Gly Lys Thr Phe Val 
100 105 110 

Ser Thr Val Asn Ser Leu Val Phe Gin Gin He Asp Ala Asn Trp Asn 
115 120 125 

He Gin Cys Trp Leu Lys Gly Asp Leu Lys Leu Phe He Cys Tyr Val 
130 135 140 

Glu Ser Leu Phe Lys Asn Leu Phe Arg Asn Tyr Asn Tyr Lys Val His 
145 150 155 160 



Leu Leu Tyr Val Leu Pro Glu Val Leu Glu Asp Ser Pro Leu Val Pro 
165 170 175 
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Gin Lys Gly Ser Phe Gin Met Val His Cys Asn Cys Ser Val His Glu 
IBO 185 190 

Cys Cys Glu Cys Leu Val Pro Val Pro Thr Ala Lys Leu Asn Asp Thr 
195 200 205 

Leu Leu Met Cys Leu Lys He Thr Ser Gly Gly Val He Phe Gin Ser 
210 215 220 

Pro Leu Met Ser Val Gin Pro He Asn Met Val Lys Pro Asp Pro Pro 
225 230 235 240 

Leu Gly Leu His Met Glu He Thr Asp Asp Gly Asn Leu Lys He Ser 
245 250 255 

Trp Ser Ser Pro Pro Leu Val Pro Phe Pro Leu Gin Tyr Gin Val Lys 
260 265 270 

Tyr Ser Glu Asn Ser Thr Thr Val He Arg Glu Ala Asp Lys He Val 
275 280 285 , 

Ser Ala Thr Ser Leu Leu Val Asp Ser He Leu Pro Gly Ser Ser Tyr 
290 295 300 

Glu Val Gin Val Arg Gly Lys Arg Leu Asp Gly Pro Gly He Trp Ser 
305 310 315 320 

Asp Trp Ser Thr Pro Arg Val Phe Thr Thr Gin Asp Val He Tyr Phe 
325 330 335 

Pro Pro Lys He Leu Thr Ser Val Gly Ser Asn Val Ser Phe His Cys 
340 345 350 

He Tyr Lys Lys Glu Asn Lys He Val Pro Ser Lys Glu He Val Trp 
355 360 365 

Trp Met Asn Leu Ala Glu Lys He Pro Gin Ser Gin Tyr Asp Val Val 
370 375 380 

Ser Asp His Val Ser Lys Val Thr Phe Phe Asn Leu Asn Glu Thr Lys 
385 390 395 400 

Pro Arg Gly Lys Phe Thr Tyr Asp Ala Val Tyr Cys Cys Asn Glu His 
405 410 415 

Glu Cys His His Arg Tyr Ala Glu Leu Tyr Val He Asp Val Asn He 
420 425 430 

Asn He Ser Cys Glu Thr Asp Gly Tyr Leu Thr Lys Met Thr Cys Arg 
435 440 445 

Trp Ser Thr Ser Thr He Gin Ser Leu Ala Glu Ser Thr Leu Gin Leu 
450 455 460 
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10 



25 



40 



Arg Tyr His Arg Ser Ser Leu Tyr Cys Ser Asp lie Pro Ser lie His 
465 470 475 480 

Pro He Ser Glu Pro Lys Asp Cys Tyr Leu Gin Ser Asp Gly Phe Tyr 
465 490 495 

Glu Cys He Phe Gin Pro He Phe Leu Leu Ser Gly Tyr Thr Met Trp 
500 505 510 

lie Arg He Asn His Ser Leu Gly Ser lieu Asp Ser Pro Pro Thr Cys 
515 520 525 



Vai Leu Pro Asp Ser Val Vai Lys Pro Leu Pro Pro Ser Ser Val Lys 
15 530 535 540 

Ala Glu He Thr He Asn He Gly Leu Leu Lys He Ser Trp Glu Lys 
545 550 555 560 

20 Pro Val Phe Pro Glu Asn Asn Leu Gin Phe Gin He Arg Tyr Gly Leu 

565 570 575 

Ser Gly Lys Glu Val Gin Trp Lys Met Tyr Glu Val Tyr Asp Ala Lys 
580 585 590 



Ser Lys Ser Val Ser Leu Pro Val Pro Asp Leu Cys Ala Val Tyr Ala 
595 600 605 



Val Gin Val Arg Cys Lys Arg Leu Asp Gly Leu Gly Tyr Trp Ser Asn 
30 610 615 620 

Trp Ser Asn Pro Ala Tyr Thr Val Val Met Asp He Lys Val Pro Met 
625 630 635 640 

35 Arg Gly Pro Glu Phe Trp Arg He He Asn Gly Asp Thr Met Lys Lys 

645 650 655 



Glu Lys Asn Val Thr Leu Leu Trp Lys Pro Leu Met Lys Asn Asp Ser 

660 665 670 

Leu Cys Ser Val Gin Arg Tyr Val He Asn His His Thr Ser Cys Asn 

675 680 685 



Gly Thr Trp Ser Glu Asp Val Gly Asn His Thr Lys Phe Thr Phe Leu 
45 690 695 700 

Trp Thr Glu Gin Ala His Thr Val Thr Val Leu Ala He Asn Ser He 

705 710 715 720 



50 



Gly Ala Ser Val Ala Asn Phe Asn Leu 
725 



Thr Phe Ser Trp Pro Met Ser 
730 735 
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Lys Val Asn lie Val Gin Ser Leu 
740 



Ser Ala Tyr Pro Leu Asn Ser Ser 
745 750 



Cys Val lie Val Ser Trp lie Leu 
755 760 



Ser Pro Ser Asp Tyr Lys Leu Met 



765 



Tyr Phe lie He Glu Trp Lys Asn 
770 775 



Leu Asn Glu Asp Gly Glu He Lys 
780 



Trp Leu Arg He Ser Ser Ser Val 
785 790 



Lys Lys Tyr Tyr He His Asp His 
795 800 



Phe He Pro He Glu Lys Tyr Gin 
805 



Phe Ser Leu Tyr Pro He Phe Met 
810 .815 



Glu Gly Val Gly Lys Pro Lys He 
820 



He Asn Ser Phe Thr Gin Asp Asp 
825 830 



He Glu Lys His Gin Ser Asp 
835 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 262 4 base pairs 

(B) TYPE: nucleic acid 
<C) STRAND EDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:ll: 

GCGGCCGCCA GTGTGATGGA TATCTGCAGA ATTCGGCTTT CTCTGCCTTC GGTCGAGTTG 60 

GACCCCCGGA TCAAGGTGTA CTTCTCTGAA GTAAGATGAT TTGTCAAAAA TTCTGTGTGG 120 

TTTTGTTACA TTGGGAATTT ATTTATGTGA TAACTGCGTT TAACTTGTCA TATCCAATTA 180 

CTCCTTGGAG ATTTAAGTTG TCTTGCATGC CACCAAATTC AACCTATGAC TACTTCCTTT 240 

TGCCTGCTGG GCTCTCAAAG AATACTTCAA ATTCGAATGG ACATTATGAG ACAGCTGTTG 300 

AACCTAAGTT TAATTCAAGT GGTACTCACT TTTCTAACTT ATCCAAAACA ACTTTCCACT 360 

GTTGCTTTCG GAGTGAGCAA GATAGAAACT GCTCCTTATG TGCAGACAAC ATTGAAGGAA 420 

AGACATTTGT TTCAACAGTA AATTCTTTAG TTTTTCAACA AATAGATGCA AACTGGAACA 480 
TACAGTGCTG GCTAAAAGGA GACTTAAAAT TATTCATCTG T T ATGTGG AG TCATTATTTA * 540 
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AGAATCTATT CAGGAATTAT AACTATAAGG TCCATCTTTT ATATGTTCTG CCTGAAGTGT 600 

TAGAAGATTC ACCTCTGGTT CCCCAAAAAG GCAGTTTTCA GATGGTTCAC TGCAATTGCA 660 

5 

GTGTTCACGA ATGTTGTGAA TGTCTTGTGC CTGTGCCAAC AGCCAAACTC AACGACACTC 720 

TCCTTATGTG TTTGAAAATC ACATCTGGTG GAGTAATTTT CCAGTCACCT CTAATGTCAG 780 

10 TTCAGCCCAT AAATATGGTG AAGCCTGATC CACCATTAGG TTTGCATATG GAAATCACAG 840 

ATGATGGTAA TTTAAAGATT TCTTGGTCCA GCCCACCATT GGTACCATTT CCACTTCAAT 900 

ATCAAGTGAA ATATTCAGAG AATTCTACAA CAGTTATCAG AGAAGCTGAC AAGATTGTCT 960 

15 

CAGCTACATC CCTGCTAGTA GACAGTATAC TTCCTGGGTC TTCGTATGAG GTTCAGGTGA 1020 

GGGGCAAGAG AC TGG ATGGC CCAGGAATCT GGAGTGACTG GAGTACTCCT CGTGTCTTTA 1080 

20 CCACACAAGA TGTCATATAC TTTCCACCTA AAATTCTGAC AAGTGTTGGG TCTAATGTTT 1140 

CTTTTCACTG CATCTATAAG AAGGAAAACA AGATTGTTCC CTCAAAAGAG ATTGTTTGGT 1200 

GGATGAATTT AG CTG AG AAA ATTCCTCAAA GCCAGTATGA TGTTGTGAGT GATCATGTTA 1260 

25 

GCAAAGTTAC TTTTTTCAAT CTGAATGAAA CCAAACCTCG AGGAAAGTTT ACCTATGATG 1320 

CAGTGTACTG CTGCAATGAA CATGAATGCC ATCATCGCTA TGCTGAATTA TATGTGATTG 1380 

30 ATGTCAATAT CAATATCTCA TGTGAAACTG ATGGGTACTT AACT AAAATG ACTTGCAGAT 1440 

GGTCAACCAG TACAATCCAG TCACTTGCGG AAAGCACTTT GCAATTGAGG TATCATAGGA 1500 

GCAGCCTTTA CTGTTCTGAT ATTCCATCTA TTCATCCCAT ATCTGAGCCC AAAGATTGCT 1560 

35 

ATTTGCAGAG TGATGGTTTT TATGAATGCA TTTTCCAGCC AATCTTCCTA TTATCTGGCT 1620 

ACACAATGTG GATTAGGATC AATCACTCTC TAGGTTCACT TGACTCTCCA CCAACATGTG 1680 

40 TCCTTCCTGA TTCTGTGGTG AAGCCACTGC CTCCATCCAG TGTGAAAGCA GAAATTACTA 1740 

TAAACATTGG ATTATTGAAA ATATCTTGGG AAAAGCCAGT CTTTCCAGAG AATAACCTTC 1800 

AATTCCAGAT TCGCTATGGT TTAAGTGGAA AAGAAGTACA ATGGAAGATG TATGAGGTTT 1860 

45 

ATGATGCAAA ATCAAAATCT GTCAGTCTCC CAGTTCCAGA CTTGTGTGCA GTCTATGCTG 1920 

TTCAGGTGCG CTGTAAGAGG CTAGATGGAC TGGGATATTG GAGTAATTGG AGCAATCCAG 1980 

50 CCTACACAGT TGTCATGGAT ATAAAAGTTC CTATGAGAGG ACCTGAATTT TGGAGAATAA 2040 

TTAATGGAGA TACTATGAAA AAGGAGAAAA ATGTCACTTT ACTTTGGAAG CCCCTGATGA 2100 
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AAAATGACTC ATTGTGCAGT GTTCAGAGAT ATGTGATAAA CCATCATACT TCCTGCAATG 2160 

GAACATGGTC AGAAGATGTG GGAAATCACA CGAAATTCAC TTTCCTGTGG ACAGAGCAAG 2220 

CACATACTGT TACGGTTCTG GCCATCAATT CAATTGGTGC TTCTGTTGCA AATTTTAATT 2280 

TAACCTTTTC ATGGCCTATG AGCAAAGTAA ATATCGTGCA GTCACTCAGT GCTTATCCTT 2340 

TAAACAGCAG TTGTGTGATT GTTTCCTGGA TACTATCACC CAGTGATTAC AAGCTAATGT 2400 

ATTTTATTAT TGAGTGGAAA AATCTTAATG AAGATGGTGA AATAAAATGG CTTAGAATCT 2460 

CTTCATCTGT TAAGAAGTAT TATATCCATG ATCATTTTAT CCCCATTGAG AAGTACCAGT 2520 

TCAGTCTTTA CCCAATATTT ATGGAAGGAG TGGGAAAACC AAAGATAATT AATAGTTTCA 2580 

CTCAAGATGA TATTGAAAAA CACCAGAGTG ATTGATAAGG ATCC 2624 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A J LENGTH: 294 8 base pairs 

(B) TYPE: nucleic acid 

(C) ST RAND ED NESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 



CCATTGAAGT 


CAATGGGAGT 


TTGTTTTGGC 


ACCAAAATCA 


ACGGGGATTT 


CCAAAATGTC 


60 


GTAATAACCC 


CGCCCCGTTG 


ACGCAAATGG 


GCGGTAGGCG 


TGTACGGTGG 


GAGGTCTATA 


120 


TAAGCAGAGC 


TCGTTTAGTG 


AACCGTCAGA 


TCTCTAGAAG 


CTGGGTACCA 


GCTGCTAGCA 


180 


AGCTTGCTAG 


CGGCCGCCAG 


TGTGATGGAT 


ATCTGCAGAA 


TTCGGCTTTC 


TCTGCCTTCG 


240 


GTCGAGTTGG 


ACCCCCGGAT 


CAAGGTGTAC 


TTCTCTGAAG 


TAAGATGATT 


TGTCAAAAAT 


300 


TCTGTGTGGT 


TTTGTTACAT 


TGGGAATTTA 


TTTATGTGAT 


AACTGCGTTT 


AACTTGTCAT 


360 


ATCCAAT T AC 


TCCTTGGAGA 


TTTAAGTTGT 


CTTGCATGCC 


ACCAAATTCA 


ACCTATGACT 


420 


ACTTCCTTTT 


GCCTGCTGGG 


CTCTCAAAGA 


ATACTTCAAA 


TTCGAATGGA 


CATTATGAGA 


480 


CAGCTGTTGA 


ACCTAAGTTT 


AATTCAAGTG 


GTACTCACTT 


TTCTAACTTA 


TCCAAAACAA 


540 


CTTTCCACTG 


TTGCTTTCGG 


AGTGAGCAAG 


ATAGAAACTG 


CTCCTTATGT 


GCAGACAACA 


600 
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TTGAAGGAAA GACATTTGTT TCAACAGTAA 
ACTGGAACAT ACAGTGCTGG CTAAAAGGAG 
5 CATTATTTAA GAATCTATTC AGGAATTATA 
CTGAAGTGTT AGAAGATTCA CCTCTGGTTC 
GCAATTGCAG TGTTCACGAA TGTTGTGAAT 

10 

ACGACAC TCT CCTTATGTGT TTGAAAATCA 
TAATGTCAGT TCAGCCCATA AATATGGTGA 
15 AAATCACAGA TGATGGTAAT TTAAAGATTT 
CACTTCAATA TCAAGTGAAA TATTCAGAGA 
AGATTGTCTC AGCTACATCC CTGCTAGTAG 

20 

TTCAGGTGAG GGGCAAGAGA CTGGATGGCC 
GTGTCTTTAC CACACAAGAT GTCATATACT 
25 CTAATGTTTC TTTTCACTGC ATCTATAAGA 
TTGTTTGGTG GATGAATTTA GCTGAGAAAA 
ATCATGTTAG CAAAGTTACT TTTTTCAATC 

30 

CCTATGATGC AGTGTACTGC TGCAATGAAC 
ATGTGATTGA TGTCAATATC AATATCTCAT 
35 CTTGCAGATG GTCAACCAGT ACAATCCAGT 
ATCATAGGAG CAGCCTTTAC TGTTCTGATA 
AAGATTGCTA TTTGCAGAGT GATGGTTTTT 

40 

TATCTGGCTA CACAATGTGG AT T AGGATCA 
CAACATGTGT CCTTCCTGAT TCTGTGGTGA 
45 AAATTACTAT AAACATTGGA TTATTGAAAA 
ATAACCTTCA ATTCCAGATT CGCTATGGTT 
ATGAGGTTTA TGATGCAAAA TCAAAATCTG 

50 

TCTATGCTGT TCAGGTGCGC TGTAAGAGGC 
GCAATCCAGC CTACACAGTT GTCATGGATA 
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ATTCTTTAGT TTTTCAACAA ATAGATGCAA 660 

ACTTAAAATT ATTCATCTGT TATGTGGAGT 720 

ACTATAAGGT CCATCTTTTA TATGTTCTGC 7 80 

CCCAAAAAGG CAGTTTTCAG ATGGTTCACT 84 0 

GTCTTGTGCC TGTGCCAACA GCCAAACTCA 900 

CATCTGGTGG AGTAATTTTC CAGTCACCTC 960 

AGCCTGATCC ACCATTAGGT TTGCATATGG 1020 

CTTGGTCCAG CCCACCATTG GTACCATTTC 1080 

ATTCTACAAC AGTTATCAGA GAAGCTGACA 1140 

ACAGTATACT TCCTGGGTCT TCGTATGAGG 1200 

CAGGAATCTG GAGTGACTGG AGTACTCCTC 1260 

TTCCACCTAA AATTCTGACA AGTGTTGGGT 1320 

AGGAAAACAA GATTGTTCCC TCAAAAGAGA 1380 

TTCCTCAAAG CCAGTATGAT GTTGTGAGTG 1440 

TGAATGAAAC CAAACCTCGA GGAAAGTTTA 1500 

ATGAATGCCA TCATCGCTAT GCTGAATTAT 1560 

GTG AAACTGA TGGGTACTTA AC T AAAATGA 1620 

CACTTGCGGA AAGCACTTTG CAATTGAGGT 1680 

TTCCATCTAT TCATCCCATA TCTGAGCCCA 1740 

ATGAATGCAT TTTCCAGCCA ATCTTCCTAT 1800 

ATCACTCTCT AGGTTCACTT GACTCTCCAC 1860 

AGCCACTGCC TCCATCCAGT GTGAAAGCAG 1920 

TATCTTGGGA AAAGCCAGTC TTTCCAGAGA 1980 

TAAGTGGAAA AGAAGTACAA TGGAAGATGT 2040 

TCAGTCTCCC AGTTCCAGAC TTGTGTGCAG 2100 

TAGATGGACT GGGATATTGG AGTAATTGGA 2160 

TAAAAGTTCC TATGAGAGGA CCTGAATTTT 2220 



WO 97/25424 



PCT/US97/00128 



- 128 - 

GGAGAATAAT TAATGGAGAT ACTATGAAAA AGGAGAAAAA 
CCCTGATGAA AAATGACTCA TTGTGCAGTG TTCAGAGATA 
CCTGCAATGG AACATGGTCA GAAGATGTGG GAAATCACAC 
CAGAGCAAGC ACATACTGTT ACGGTTCTGG CCATCAATTC 
ATTTTAATTT AACCTTTTCA TGGCCTATGA GCAAAGTAAA 
CTTATCCTTT AAACAGCAGT TGTGTGATTG TTTCCTGGAT 
AGCTAATGTA TTTTATTATT GAGTGGAAAA ATCTTAATGA 
TTAGAATCTC TTCATCTGTT AAGAAGTATT ATATCCATGA 
AGTACCAGTT CAGTCTTTAC CCAATATTTA TGGAAGGAGT 
ATAGTTTCAC TCAAGATGAT ATTGAAAAAC ACCAGAGTGA 
ACGATGACAA GTAGGGATCC AGACATGATA AG AT AC AT TG 
ACTAGAATGC AGTGAAAAAA ATGCTTTATT TGTGAAATTT 
GTAACCAT 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 804 amino acids 
(3) TYPE: amino acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: protein 



TGTC AC TTT A 
TGTGATAAAC 
GAAATTCACT 
AATTGGTGCT 
TATCGTGCAG 
ACTATCACCC 
AGATGGTGAA 
TCATTTTATC 
GGGAAAACCA 
TGCAGGTGAC 
ATGAGTTTGG 
GTGATGCTAT 



CTTTGGAAGC 

CATCATACTT 

TTCCTGTGGA 

TCTGTTGCAA 

TCACTCAGTG 

AGTGATTACA 

ATAAAATGGC 

CCCATTGAGA 

AAGATAATTA 

TACAAGGACG 

ACAACCCACA 

TGCTTT ATT T 



2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

2948 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

Met lie Cys Gin Lys Phe Cys Val Val Leu Leu His Trp Glu Phe He 
1 5 io 15 

Tvr Val He Thr Ala Phe Asn Leu Ser Tyr Pro He Thr Pro Trp Arg 
20 25 30 

Phe Lys Leu Ser Cys Met Pro Pro Asn Ser Thr Tyr Asp Tyr Phe Leu 



35 



40 



45 



Leu Pro Ala Gly Leu Ser Lys Asn Thr Ser Asn Ser Asn Gly His Tyr 
50 55 60 
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10 



25 



40 



Glu Thr Ala Val Glu Pro Lys Phe Asn Ser Ser Gly Thr His Phe Ser 
65 70 75 80 

Asn Leu Ser Lys Thr Thr Phe His Cys Cys Phe Arg Ser Glu Gin Asp 
85 90 95 

Arg Asn Cys Ser Leu Cys Ala Asp Asn lie Glu Gly Lys Thr Phe Val 
100 105 110 

Ser Thr Val Asn Ser Leu Val Phe Gin Gin lie Asp Ala Asn Trp Asn 
115 120 125 



lie Gin Cys Trp Leu Lys Gly Asp Leu Lys Leu Phe lie Cys Tyr Val 
15 130 135 140 

Glu Ser Leu Phe Lys Asn Leu Phe Arg Asn Tyr Asn Tyr Lys Val His 
145 150 155 160 

20 Leu Leu Tyr Val Leu Pro Glu Val Leu Glu Asp Ser Pro Leu Val Pro 

165 170 175 

Gin Lys Gly Ser Phe Gin Met Val His Cys Asn Cys Ser Val His Glu 
180 185 190 



Cys Cys Glu Cys Leu Val Pro Val Pro Thr Ala Lys Leu Asn Asp Thr 
195 200 205 



Leu Leu Met Cys Leu Lys lie Thr Ser Gly Gly Val lie Phe Gin Ser 

30 210 215 220 

Pro Leu Met Ser Val -Gin Pro He Asn Met Val Lys Pro Asp Pro Pro 
225 230 235 240 

35 Leu Gly Leu His Met Glu lie Thr Asp Asp Gly Asn Leu Lys He Ser 

245 250 255 



Trp Ser Ser Pro Pro Leu Val Pro Phe Pro Leu Gin Tyr Gin Val Lys 

260 265 270 

Tyr Ser Glu Asn Ser Thr Thr Val He Arg Glu Ala Asp Lys He Val 
275 280 285 



Ser Ala Thr Ser Leu Leu Val Asp Ser He Leu Pro Gly Ser Ser Tyr 
45 290 295 300 

Glu Val Gin Val Arg Gly Lys Arg Leu Asp Gly Pro Gly He Trp Ser 
305 310 315 320 



50 



Asp Trp Ser Thr Pro 

325 



Arg Val Phe Thr Thr Gin Asp Val He Tyr Phe 
330 335 
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Pro Pro Lys He Leu Thr Ser Val Gly Ser Asn Val Ser Phe His Cys 
340 345 350 

lie Tyr Lys Lys Glu Asn Lys He Val Pro Ser Lys Glu He Val Trp 
355 360 365 

Trp Met Asn Leu Ala Glu Lys He Pro Gin Ser Gin Tyr Asp Val Val 
370 375 380 

Ser Asp His Val Ser Lys Val Thr Phe Phe Asn Leu Asn Glu Thr Lys 
385 390 395 400 

Pro Arg Gly Lys Phe Thr Tyr Asp Ala Val Tyr Cys Cys Asn Glu His 
405 410 415 

Glu Cys His His Arg Tyr Ala Glu Leu Tyr Val He Asp Val Asn He 
420 425 430 

Asn He Ser Cys Glu Thr Asp Gly Tyr Leu Thr Lys Met Thr Cys Arg 
435 440 445 

Trp Ser Thr Ser Thr He Gin Ser Leu Ala Glu Ser Thr Leu Gin Leu 
450 455 460 

Ara Tyr His Arg Ser Ser Leu Tyr Cys Ser Asp He Pro Ser He His 
465 470 475 480 

Pro He Ser Glu Pro Lys Asp Cys Tyr Leu Gin Ser Asp Gly Phe Tyr 
485 490 495 

Glu Cys lie Phe Gin Pro He Phe Leu Leu Ser Gly Tyr Thr Met Trp 
500 " 505 510 

He Arg He Asn His Ser Leu Gly Ser Leu Asp Ser Pro Pro Thr Cys 
515 520 525 

Val Leu Pro Asp Ser Val Val Lys Pro Leu Pro Pro Ser Ser Val Lys 
530 535 540 

Ala Glu He Thr He Asn He Gly Leu Leu Lys He Ser Trp Glu Lys 
545 550 555 560 

Pro Val Phe Pro Glu Asn Asn Leu Gin Phe Gin He Arg Tyr Gly Leu 
565 570 575 

Ser Gly Lys Glu Val Gin Trp Lys Met Tyr Glu Val Tyr Asp Ala Lys 
5B0 585 590 

Ser Lys Ser Val Ser Leu Pro Val Pro Asp Leu Cys Ala Val Tyr Ala 
595 600 605 

Val Gin Val Arg Cys Lys Arg Leu Asp Gly Leu Gly Tyr Trp Ser Asn 
610 615 620 
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10 



25 



Trp Ser Asn Pro Ala Tyr Thr Val Val Met Asp He Lys Val Pro Met 
625 630 635 640 

Arg Gly Pro Glu Phe Trp Arg lie He Asn Gly Asp Thr Met Lys Lys 
645 650 655 

Glu Lys Asn Val Thr Leu Leu Trp Lys Pro Leu Met Lys Asn Asp Ser 
660 665 670 

Leu Cys Ser Val Gin Arg Tyr Val He Asn His His Thr Ser Cys Asn 
675 680 685 



Gly Thr Trp Ser Glu Asp Val Gly Asn His Thr Lys Phe Thr Phe Leu 
15 690 695 700 

Trp Thr Glu Gin Ala His Thr Val Thr Val Leu Ala lie Asn Ser He 
705 710 715 720 

20 Gly Ala Ser Val Ala Asn Phe Asn Leu Thr Phe Ser Trp Pro Met Ser 

725 730 735 

Lys Val Asn He Val Gin Ser Leu Ser Ala Tyr Pro Leu Asn Ser Ser 
740 745 750 



Cys Val He Val Ser Trp He Leu Ser Pro Ser Asp Tyr Lys Leu Met 
755 760 765 



Tyr Phe He He Glu Trp Lys Asn Leu Asn Glu Asp Gly Glu He Lys 
30 770 775 780 

Trp Leu Arg He Ser Ser Ser Val Lys Lys Tyr Tyr He His Gly Lys 
785 790 795 800 

35 Phe Thr He Leu 



(2) INFORMATION FOR SEQ ID NO: 14: 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2507 base pairs 

(B) TYPE: nucleic acid 
<C> STRANDEDNESS : single 
(D) TOPOLOGY: linear 



45 



(ii) MOLECULE TYPE: cDNA 



50 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

GCGGCCGCCA GTGTGATGGA TATCTGCAGA ATTCGGCTTT CTCTGCCTTC GGTCGAGTTG 60 

5 GACCCCCGGA TCAAGGTGTA CTTCTCTGAA GTAAGATGAT TTGTCAAAAA TTCTGTGTGG 120 

TTTTGTTACA TTGGGAATTT ATTTATGTGA TAACTGCGTT TAACTTGTCA TATCCAATTA 180 

CTCCTTGGAG ATTTAAGTTG TCTTGCATGC CACCAAATTC AACCTATGAC TACTTCCTTT 240 

10 TGCCTGCTGG GCTCTCAAAG AATACTTCAA ATTCGAATGG ACATTATGAG ACAGCTGTTG 300 

AACCTAAGTT TAATTCAAGT GGTACTCACT TTTCTAACTT ATCCAAAACA ACTTTCCACT 360 

15 GTTGCTTTCG GAGTGAGCAA GATAGAAACT GCTCCT T ATG TGCAGACAAC ATTGAAGGAA 420 

AGACATTTGT TTCAACAGTA AATTCTTTAG TTTTTCAACA AATAGATGCA AACTGGAACA 480 

TACAGTGCTG GCTAAAAGGA GACTTAAAAT TATTCATCTG TTATGTGGAG TCATTATTTA 540 

20 

AGAATCTATT CAGGAATTAT AACTATAAGG TCCATCTTTT ATATGTTCTG CCTGAAGTGT 600 

TAGAAGATTC ACCTCTGGTT CCCCAAAAAG GCAGTTTTCA GATGGTTCAC TGCAATTGCA 660 

25 GTGTTCACGA ATGTTGTGAA TGTCTTGTGC CTGTGCCAAC AGCCAAACTC AACGACACTC 720 

TCCTTATGTG TTTGAAAATC ACATCTGGTG GAGTAATTTT CCAGTCACCT CTAATGTCAG 780 

TTCAGCCCAT AAATATGGTG AAGCCTGATC CACCATTAGG TTTGCATATG GAAATCACAG 840 

30 

ATGATGGTAA TTTAAAGATT TCTTGGTCCA GCCCACCATT GGTACCATTT CCACTTCAAT 900 

ATCAAGTGAA ATATTCAGAG AATTCTACAA CAGTTATCAG AGAAGCTGAC AAGATTGTCT 960. 

35 CAGCTACATC CCTGCTAGTA GACAGTATAC TTCCTGGGTC TTCGTATGAG GTTCAGGTGA 1020 

GGGGCAAGAG ACTGGATGGC CCAGGAATCT GGAGTGACTG GAGTACTCCT CGTGTCTTTA 1080 

CCACACAAGA TGTCATATAC TTTCCACCTA AAATTCTGAC AAGTGTTGGG TCT AATGTTT 1140 

40 CTTTTCACTG CATCTATAAG AAGGAAAACA AGATTGTTCC CTCAAAAGAG ATTGTTTGGT 1200 

GGATGAATTT AGCTGAGAAA ATTCCTCAAA GCCAGTATGA TGTTGTGAGT GATCATGTTA 1260 

45 QCAAAGTTAC TTTTTTCAAT CTGAATGAAA CCAAACCTCG AGGAAAGTTT ACCTATGATG 1320 

CAGTGTACTG CTGCAATGAA CATGAATGCC ATCATCGCTA TGCTGAATTA TATGTGATTG 1380 

ATGTCAATAT CAATATCTCA TGTGAAACTG ATGGGTACTT AACTAAAATG ACTTGCAGAT 1440 

50 

GGTCAACCAG TACAATCCAG TCACTTGCGG AAAGCACTTT GCAATTGAGG TATCATAGGA 1500 

GCAGCCTTTA CTGT TCTG AT ATTCCATCTA TTCATCCCAT ATC TGAGCCC AAAGATTGCT 1560 
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ATTTGCAGAG TGATGGTTTT TATGAATGCA TTTTCCAGCC AATCTTCCTA TTATCTGGCT 1620 
ACACAATGTG GATTAGGATC AATCACTCTC TAGGTTCACT TGACTCTCCA CCAACATGTG 1680 

5 

TCCTTCCTGA T7CTGTGGTG AAGCCACTGC CTCCATCCAG TGTGAAAGCA GAAATTACTA 1740 
TAAACATTGG ATTATTGAAA ATATCTTGGG AAAAGCCAGT CTTTCCAGAG AATAACCTTC 1800 
10 AATTCCAGAT TCGCTATGGT TTAAGTGGAA AAGAAGTACA ATGGAAGATG TATGAGGTTT 18 60 

ATGATGCAAA ATCAAAATCT GTCAGTCTCC CAGTTCCAGA CTTGTGTGCA GTCTATGCTG 1920 
TTCAGGTGCG CTGTAAGAGG CTAGATGGAC TGGGATATTG GAGTAATTGG AGCAATCCAG 1980 

15 

CCTACACAGT TGTCATGGAT ATAAAAGTTC CTATGAGAGG ACCTGAATTT TGGAGAATAA 2040 
TTAATGGAGA TACTATGAAA AAGGAGAAAA ATGTCACTTT ACTTTGGAAG CCCCTGATGA 2100 
20 AAAATGACTC ATTGTGCAGT GTTCAGAGAT ATGTGATAAA CCATCATACT TCCTGCAATG 2160 
GAACATGGTC AGAAGATGTG GGAAATCACA CGAAATTCAC TTTCCTGTGG ACAGAGCAAG 2220 
CACATACTGT TACGGTTCTG GCCATCAATT CAATTGGTGC TTCTGTTGCA AATTTTAATT 2280 

25 

TAACCTTTTC A7GGCCTATG AGCAAAGTAA ATATCGTGCA GTCACTCAGT GCTTATCCTT 2340 
TAAACAGCAG TTGTGTGATT GTTTCCTGGA TACTATCACC CAGTGATTAC AAGCTAATGT 2400 
30 ATTTTATTAT TGAGTGGAAA AATCTTAATG AAGATGGTGA AATAAAATGG CTTAGAATCT 2460 
CTTCATCTGT TAAGAAGTAT TATATCCATG GTAAGTTTAC TATACTT 2507 
(2) INFORMATION FOR SEQ ID NO: 15: 

35 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
40 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

45 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
GTAAGTTATT TGNNNNNATA TCCTAACAG 29 

50 
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(2) INFORMATION FOR SEQ ID NO : 1 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D> TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
GTAAGCATTA GCNNNNNTTT TAAATTCAG 
(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:17: 
GTAAGTACCA AANNNNN.TTT TCAATATAG 
(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
GTAAGTTATG CANNNNNTTT TTCCTTAAG 



29 
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(2) INFORMATION FOR SEQ ID NO : 1 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 
CD) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

GTAAGTATAT TTNNNNAATA TTTAACAG 28 

(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 2 9 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:20: 
GTAGGTTATG TANNNNNCCC TCATTACAG 29 
(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:21: 
GTAAGAAAAC AGNNNNNTGT TTCAAATAG 



29 
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(2) INFORMATION FOR SEQ ID NO:22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: cDNA 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

GTACGTATTA TTNNNNNTAT CTTTTAAAG 

(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 2 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 
GTATGTCAAG CTNNNNN AAA AATTTCTAG 
(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 
GTACCTTTTA CTNNNNNCTT ATTTTACAG 
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10 



15 



25 



(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 25: 
GTCTGCAGAG ATNNNNNGTC ATTTTGCAG 29 
(2) INFORMATION FOR SEQ ID NO:26: 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE : nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 



30 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 
GTATTCCCAA TTNNNNNTAT TTACTACAG 2 9 

35 (2) INFORMATION FOR SEQ ID NO:27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 
40 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:27: 
50 GTATTCCCAA TTNNNNNTAT TTACTACAG 



29 
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(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 
GTAAGTTTAC TANNNNNTTT TCTCCTCAG 
(2) INFORMATION FOR SEQ ID NO: 29: 

<i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D> TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:29: 
GTAAAAATTA TANNNNNTTT CTTTTTCAG 
<2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:30: 
GTATTGTACT TGNNNNNTAT CCTTTGTAG 
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(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



<xi> SEQUENCE DESCRIPTION: SEQ ID NO: 31: 
GTTGCTTTTT CANNNNNTTA TCTAAACAG 29 
(2) INFORMATION FOR SEQ ID NO : 32 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 
GTACATTTGT CTNNNNNCTT TTCTTTTAG 29 
<2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 
GTATCCAGTG TTNNNNNCTT TTTAAACAG 



29 
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CLAIMS 

1. An OB receptor protein preparation 
5 containing an OB . receptor protein, optionally in a 

pharmaceutical^ acceptable formulation, said OB 
receptor protein having part or all of the amino acid 
sequence according to Seq. ID No. 1 and one or more of 
the biological properties of naturally occurring OB 
10 receptor protein. 

2. An OB receptor protein preparation 
containing an OB receptor protein, optionally in a 
pharmaceutical^ acceptable formulation, wherein said OB 

15 receptor protein amino acid sequence is selected from 

among amino acid sequences (according to Seq. ID No. 1) : 

(a) 1-896; 

(b) 22-896 optionally with an N-terminal 

methionyl residue; 
20 (c) 23-896 optionally with an N-terminal 

methionyl residue; 

(d) 29-896 optionally with an N-terminal 

methionyl residue; 

(e) 1-839; 

25 (f) 22-839 optionally with an N-terminal 

methionyl residue; 

(g) 29-839 optionally with an N-terminal 

methionyl residue; 

(h) 1-841; 

30 (i) 22-841 optionally with an N-terminal 

methionyl residue; 

(j) 23-841 optionally with an N-terminal 

methionyl residue; 

(k) 29-841 optionally with an N-terminal 

35 methionyl residue; 

(1) 1-891; 
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(m) 22-891 optionally with an N-terminal 
methionyl residue; 

(n) 23-891 optionally with an N-terminal 
methionyl residue; 
5 (o) 29-891 optionally with an N-terminal 

methionyl residue; 

(p) of subparts (1) through (o) further 
having the C-terminal amino acids, beginning at position 
892, of OB receptor B (Seq. ID No. 3) or C (Seq. ID. No. 
10 5) ; and, 

(q) a chemically modified derivative of 
any of subparts (a) through (p) . 

3. An OB receptor protein preparation of 
15 claim 2 wherein said OB receptor protein is further 

selected from among the OB receptor proteins of subparts 
(1) through (o) further having the C-terminal amino 
acids, beginning at position 8 92, of OB receptor protein 
D (Seq. ID No. 7) . 

20 

4. An OB receptor protein preparation of 
claim 2 wherein said OB receptor protein is further 
selected from among the OB receptor proteins of subparts 
(1) through (o) further having substituted the C- 

25 terminal amino acids, beginning at .position 799, G K F T 
I L (Seq. ID No. 13) . 



WO 97/25424 



PCT/US97/00128 



- 142 - 



5. An OB receptor protein preparation 
according to any of claims 1 through 4, wherein the 
extracellular domain of said OB receptor protein is 
5 modified, said modification selected from among: 

(a) deletion of all or part of the 

random coil domain; 

(b) modification of one or both ~WSXWS" 
boxes by substition of the first serine with another 

10 amino acid; 

(c) modification of one or both ^WSXWS" 
boxes by substitution of the last serine with another 
amino acid; and 

(d) modification of one or both "WSXWS" 
15 boxes by substitution of the first tryptophan with 

another amino acid. 

6. A DNA molecule encoding an OB receptor 
protein according to any of claims 1-5 selected from the 

20 group consisting of: 

(a) the DNA sequences set forth in Seq. 

ID nos. 2, 4, 6, 8, 11, 12, and 14; 

(b) a DNA which selectively hybridizes 

to a DNA of subpart (a) ; and 
25 (c) a DNA which, but for the degeneracy 

of the genetic code would hybridize to a DNA of subpart 
(a) or (b) . 

7. A biologically functional viral or 
30 plasmid vector containing a DNA of claim 6. 

8, A procaryotic or eucaryotic host cell 
containing the vector of claim 7 . . 

35 9. A host cell modified so that expression 

of endogenous OB receptor protein is enhanced. 
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10. A host cell of claim 9 which is an 
isolated human host cell. 

5 11. A process for producing an OB receptor 

protein comprised of culturing, under suitable 
conditions, a host cell according to any of claims 8, 9 
or 10, obtaining the OB receptor produced, and 
optionally preparing a pharmaceutical composition 
10 containing said OB receptor. 

12. A method of treating an individual for a 
therapeutic disorder selected from among obesity, 
diabetes, high blood lipid levels, and high cholesterol 
15 levels comprised of administering a therapeutic amount 
of an OB receptor protein preparation containing an OB 
receptor protein according to any of claims 1-5, or 
produced by the process according to claim 11. 

20 13. A method of treating an individual for 

weight loss or weight maintenance for solely cosmetic 
purposes comprised of administering" an effective amount 
of an OB receptor preparation containing an OB receptor 
protein according to any of claims 1-5, or produced by 

25 the process according to claim 11. 

14. Use of an OB receptor protein according 
to claims 1-5, or produced by the process of claim 11, 
for manufacturing a medicament for the treatment of 
30 obesity, diabetes, high blood lipid levels, or high 
cholesterol levels . 
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15. An OB protein/OB receptor protein complex 
preparation, containing an OB protein moiety and an OB 
receptor protein moiety, optionally in a pharmaceutic 
5 cally acceptable formulation, wherein: 

(a) said OB receptor protein is 
selected from among those set forth in any of claims 1 
and 2; 

(b) said OB protein moiety is selected 

10 from among: 

(i) a naturally ocurring OB 

protein; and, 

(ii) a non-naturally ocurring OB 
protein, analog or derivative thereof. 

15 

16. An OB protein/OB receptor protein complex 
preparation of claim 15 wherein said OB receptor protein 
is selected from among those set forth in any of claims 
3, 4, and 5. 

20 

17. A method of treating an individual for a 
therapeutic disorder selected from among obesity, 
diabetes, high blood lipid levels, and high cholesterol 
levels comprised of administering a therapeutic amount 

25 of an OB protein/OB receptor protein complex preparation 
of claims 15 or 16. 

18. A method of claim 17 wherein said OB 
protein/OB receptor protein complex preparation is 

30 formed in vivo by administering, into a patient,, a first 
population of cells expressing an OB protein, and a 
second population of cells expressing an OB receptor 
protein. 
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19. A method of treating an individual, for 
weight loss or weight maintenance for solely cosmetic 
purposes comprised of administering a therapeutic amount 
5 of an OB protein/OB receptor protein complex preparation 
containing an OB receptor protein moiety according to 
any of claims 1-5, or produced by the process according 
to claim 11. 

10 20. Use of an OB protein/OB receptor protein 

complex preparation, according to claims 15 or 16, for 
manufacturing a medicament for the treatment of obesity, 
diabetes, high blood lipid levels, or high cholesterol 
levels. 
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